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5 PRIMATE, PARTICULARLY HUMAN, VOMERONASAL-LIKE RECEPTOR 

FIFLD OF THE INVENTION 

This invention relates to a novel primate vomeronasal-like receptor, 
particularly a human vomeronasal-like receptor (hVLRl), which is homologous to 
10 putative rat and mouse pheromone receptors, and allelic variants thereof. This 
invention further relates to nucleic acids encoding the primate vomeronasal-like 
receptor proteins. The invention also relates to a method for producing primate 
vomeronasal-like receptor, a method for detecting expression of these receptors, and 
screening assays for hVLRl agonists and antagonists. 

15 

BACKGROUND OF THE INVENTION 

The olfactory system provides sensory information about the chemical 
composition of the external world. In mammals, olfactory chemoreception initiates at 
the level of sensory neurons that are located in the main olfactory epithelium (MOE) 

20 and the epithelium of the vomeronasal organ (VNO). The MOE mediates the 

detection of volatile odorants. The VNO mediates mainly the detection of nonvolatile 
odorants, such as pheromones. These are chemical signals that provide information 
about gender, dominance, and reproductive status between individuals of the same 
species (Sorensen, Chem. Sens. 21:245-256, 1996). Pheromones elicit in the 

25 recipients innate and stereotyped reproductive and social behaviors, along with 
profound neuroendocrine and physiological changes. 

In mammals, the VNO resides in a blind-ended pouch within the 
septum of the nose. Axonal projections from the VNO converge to form the 
vomeronasal nerve and reach target cells within the accessory olfactory bulb. The 

30 VNO is exclusively connected to specialized centers of the limbic system, including 
the vomeronasal amygdala, the bed nucleus of the stria terminalis, and specific nuclei 
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of the ventromedial hypothalamus involved in reproduction and aggression (Herrada 
et al, Cell 90:763-773, 1997). Pheromones activate the VNO which results in 
behavioral and endocrine responses that do not involve higher cognitive centers of the 
brain (Dulac and Axel, Cell 83:195-206, 1995). 

5 It is now well established that a major proportion (over 70%) of the 

human odorant receptor repertoire consists of pseudogenes (Rouquier, S. et al. Nat. 
Genet. 18, 243-250, 1998; Rouquier, S.et al., Proc. Natl. Acad. Sci. USA 97, 
2870-2874, 2000; Mombaerts, P. Curr. Opin. Genet. Dev. 9, 315-320, 1999), 
reflecting perhaps a human's decreased dependence on olfactory cues compared to 

1 0 other mammals. However, the existence of pheromones and a functional vomeronasal 
system in humans remains controversial (Preti, G. & Wysocki, C.J. Advances in 
Chemical Signals in Vertebrates (ed. Johnston, R.E.) 315-331 (Plenum Press, New 
York, 1999)). Despite an undisputed presence of VNO-like structure during early 
human embryogenesis, it regresses after birth to become vestigial in adults 

15 (Humphrey, T. J. Comp. Neurol. 73, 431-468, 1940; Stensaas, L.J. et al., J. Steroid 
Biochem. Mol. Biol. 39, 553-560, 1991). However, it is inappropriate to consider the 
VNO as the exclusive site of pheromone detection (Johnston, R.E. Vol.855 (ed. 
Murphy, C.) 333-348 (Annals of the New York Academy of Sciences, New York, 
1998), because some mammals such as the rabbit (Hudson, R. & Distel, H.. Physiol. 

20 Behav. 37, 123-128,1986). and the pig (Domes, K.M., et al, B.P. Brain Behav. Evol. 
49, 53-62, 1997) are able to detect pheromones via the main olfactory system; 
furthermore, fish lack a VNO but express V2R homologs within their olfactory 
epithelium (Naito, T. et al., Proc. Natl. Acad. Sci. USA 95, 5178-5181, 1998; Cao, Y. 
et al., Proc. Natl. Acad. Sci. USA 95, 1 1987-1 1992, 1998). 

25 The functional and anatomical dichotomy between the main and 

vomeronasal (or accessory) olfactory systems is further reflected at the level of the 
molecules that serve as receptors, or putative receptors, for their respective sensory 
stimuli. In the main olfactory system, odorant receptor genes encode seven- 
transmembrane proteins and are members of a multigene family that may comprise as 

30 many as 1000 genes in rat and mouse (Buck and Axel. Cell 65:175-187, 1991). In the 
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VNO, two families of genes encoding seven-transmembrane proteins have been 
proposed to encode pheromone receptors. The first family of vomeronasal receptor 
genes consists of 30-100 genes that are expressed selectively in vomeronasal sensory 
neurons of the apical zone of the epithelium of the VNO (Dulac and Axel, supra). 

5 The second family of vomeronasal receptor genes comprises 30-140 genes that are 
expressed in vomeronasal sensory neurons of the basal zone (Herrada et ai, supra). 
There are no conserved motifs between the two families of vomeronasal receptors, 
and vomeronasal receptors have no sequence homology with odorant receptors. 
Neurons in the apical and basal zones express G protein subunits, respectively, 

10 and G Q0 , and project their axons to distinct regions in the mouse accessory olfactory 
bulb (Berghard and Buck, J. Neurosci. 16:909-918, 1996). The existence of 
segregated fibers and corresponding G proteins suggests that distinct pheromone 
signals are likely to elicit electrical stimulation of restricted populations of VNO 
sensory neurons in order to generate distinct behavioral responses (Herrada et ai, 

15 supra). 

It is not known whether G proteins are involved in signal transduction 
of pheromonal stimuli; they serve as useful molecular markers whose expression 
tends to correlate with that of the two families of vomeronasal receptor genes. The 
vomeronasal receptor genes encode putative pheromone receptors; there is no 

20 evidence that any of these molecules is a receptor for a pheromone. Because few 
mammalian pheromones have been identified at the molecular level, ligand-receptor 
interactions are difficult to define (Sorensen, supra). 

Human menstrual studies provide strong evidence of the existence of 
human "pheromones". Studies of women living together in college dormitories 

25 report significant increase in synchronization, i.e., a decrease in the difference 
between onset dates, among roommates and close friends (McClintock, Nature 
229:244-245, 1 971). In the late follicular phase of women's menstrual cycles, 
odorless compounds from women's armpits accelerated the preovulatory surge of 
luteinizing hormone of recipient women and shortened their menstrual cycles. 

30 Odorless compounds from the same donors collected later in the menstrual cycle, i.e. , 
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at ovulation, delayed the luteinizing-hormone surge of the recipients and lengthened 
their menstrual cycles (Stern and McClintock, Nature 392:177-179, 1998). These 
odorless compounds are postulated as possible candidates for human pheromones. 

Recent studies show evidence supportive of a functional vomeronasal 

5 system in humans. During prenatal development, the VNO appears distinctly in the 
submucosa of the septal wall. The fully developed organ has a multistratified 
epithelium around a narrow lumen (Kjaer and Fischer Hansen, Eur. J. Oral Sci. 
104:34-40, 1996). Clinical examinations revealed paired bilateral vomeronasal pits on 
the anterior third of the nasal septum. The vomeronasal pit leads to a closed tube, 

10 lined by a unique pseudostratified columnar epithelium with short microvilli (Morgan 
et a/., J. Steroid Biochem. Molec. Biol. 39:545-552, 1991). Calbindin-like 
immunoreactivity has been found in epithelial cells of the newborn and adult human 
vomeronasal organ (Johnson et a/., Brain Research 638:329-333, 1994). Some studies 
suggest that adult human VNO may display species-specific, gender-dimorphic and 

15 stereospecific responses to vomeropherin ligands (Monti -Bloch et al, 
Psychoneuroendocrinology 19(5-7):673-386, 1994). 

In U.S. Pat. No. 5,668,006, G-protein linked receptors are reported to 
control many physiological functions, such as mediating transmembrane signaling 
from external stimuli (vision, taste and smell), endocrine function (pituitary and 

20 adrenal), exocrine fiinction (pancreas), heart rate, lipolysis, and carbohydrate 

metabolism. The molecular cloning of a number of such receptors have revealed 
many structural and genetic similarities, permitting classification of the G protein- 
linked receptor superfamily into five distinct groups. 

U.S. Pat. No. 5,691,188, describes how upon binding to the receptor, 

25 the receptor presumably undergoes a conformation change leading to activation of the 
G protein. G proteins are described as being comprised of three subunits: a guanyl- 
nucleotide binding a subunit; a p subunit; and a y subunit. G proteins cycle between 
two forms, depending on whether GDP or GTP is bound to the a subunit. When GDP 
is bound, the G protein exists as a heterotrimer, the Gapy complex. When GTP is 

30 bound, the a subunit dissociates, leaving a GPy complex. When a Gapy complex 
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operatively associates with an activated G protein coupled receptor in a cell 
membrane, the rate of exchange of GTP for bound GDP is increased and the rate of 
dissociation of the bound Get subunit from the Gapy complex increases. The free Ga 
subunit and GPy complex are capable of transmitting a signal to downstream elements 

5 of a variety of signal transduction pathways. This fundamental scheme of events 
forms the basis for a multiplicity of different cell signaling phenomena. 

Despite the evidence of a human vomeronasal organ, and the 
characterization of putative rodent pheromone receptors, to date there is no evidence 
of a functional human vomeronasal-like receptor. Only human pseudogene 

10 homologous to a rat gene encoding a vomeronasal receptor has been reported (Duke 
and Axel, Cell, 83:195-206, 1995). The present inventors efforts at cloning such a 
receptor also resulted in identification of numerous pseudogenes. Thus, there is a 
need in the art to isolate and characterize the structure of a human vomeronasal 
receptor. 

15 There is a further need to study ligand binding to and activation of such 

a receptor and to screen for agonists and antagonists. 

There is also a need to determine whether such a receptor mediates 
pheromone receptor activity. 

20 SUMMARY OF THE INVENTION 

In one embodiment, the invention provides an isolated human, 
vomeronasal-like receptor that is homologous to rat and mouse vomeronasal receptors 
at the amino acid level. The human vomeronasal-like receptor shares homology 
features found in a large number of functional rodent vomeronasal receptors that are 

25 putative pheromone receptors but, unlike earlier identified pseudogenes, this protein is 
encoded by a gene that is not disrupted. Furthermore, the genomic gene for the 
hVLRl lacks introns in the coding region. In specific embodiments, the invention 
provides allelic variants, splice variants, and alternative start-site variants of the 
human vomeronasal-like receptor. 

30 In a further embodiment, the invention provides an isolated nucleic 
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acid comprising a sequence that encodes a functional human, vomeronasal-like 
receptor. The human embodiment of this nucleic acid is free of 3' and 5' non-coding 
or non-transcribed genomic sequences. This nucleic acid can be provided as a cDNA 
or a genome sequence, or alternatively joined to a heterologous nucleic acid, such as 

5 an expression vector. Genomic hVLRl comprises three exons, with the coding region 
found in the last exon, operably associated with an endogenous promoter. 

In yet a further embodiment, the invention provides an isolated 
chimeric polypeptide comprising an amino acid sequence of a human vomeronasal- 
like receptor fused to a heterologous amino acid sequence, such as a signal peptide, an 

10 antibody tag, an expression tag, a chromatographic tag, a cytoplasmic signal domain, 
and a G-protein binding domain. 

In yet another embodiment, the invention provides an isolated 
antigenic fragment of the human vomeronasal-like receptor, as well as an antibody 
that specifically binds to the receptor. 

15 in another embodiment, the invention provides PCR primers, 

antisense, ribozyme nucleic acids, and vectors for isolation, cloning, and screening for 

thehVLRl receptor. 

In a further embodiment, the invention provides a method for isolating, 
expressing, and screening with the human vomeronasal-like receptor. 
20 In a yet further embodiment, the invention provides for identifying an 

allelic variant of a gene encoding a human vomeronasal-like receptor. In a specific 
embodiment, the polymorphism is detected by sequencing. 

Still another embodiment of the invention concerns the identification 
of a primate (chimpanzee) vomeronasal-like receptor gene, the protein encoded 
25 thereby, related products, and screens based thereon. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1. Nucleotide and deduced amino acid sequence of "short" 
human vomeronasal-like receptor, hVLRl (SEQ ID NO:l and 2, respectively). 
30 Figures 2a and 2b. Nucleic acid and deduced amino acid sequence of 
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"long" isoform of the functional vomeronasal-like receptor hVLRl (SEQ ID NO:3 
and 4, respectively). 

Figures 3a-3d. Amino acid sequence comparison of human 
vomeronasal-like receptor hVLRl with mouse pheromone receptor (emb\CAA73256) 
5 (SEQ ID NO:19) and rat VN6 pheromone receptor (pir\I61748) (SEQ ID NO:20). 

Figure 4. Comparison of mouse/rat vomeronasal receptor "consensus" 
and human vomeronasal-like receptor pseudogenes and BH33 (hVLRl). 

Figure 5 A. Southern blot of human DNA digested with EcoRJ (RI) or 
HindIII(H) hybridized with a probe specific for the VLR1 coding sequence. B. The 
10 genomic structure of V 1 RL1 . 

Figure 6. Schematic of swapping hVLRl into the mouse VR2 locus. 

Figures 7a and 7b. Alignment of the amino acid sequences of two 
human VRL1 variants (a and b) (SEQ ID NO: 15 and 16, respectively) with those of 
the putative chimpanzee ortholog (cVlRLl) (SEQ ID NO:17) and two mouse V1R 
1 5 sequences, mVR23 (SEQ ED NO: 1 8) and mpr2 (SEQ ID NO: 1 9). 

Figure 8. Blot analysis of RT-PCR products hybridized with a VRL1 

probe. 

nFTATl .F.D DESCRIPTION OF TH E INVENTION 

20 For the first time, functional cDNA clones encoding a hVLRl receptor 

have been isolated and their expression characterized (Figures 1 and 2). Eight 
different sequences, Bh33, hVNOl, h32, bh21hp5, bholhp5, h22axh, h23axbglll, 
h24axb, were found to have strong homologies with the coding regions of mouse or 
rat vomeronasal receptor genes. One functional gene sequence was isolated based on 

25 its homology to the mouse and rat orthologues, particularly at highly conserved 
positions (consensus domains) (Figure 3). The other seven of these are heavily 
mutated with multiple frameshifts and stop codons in the coding sequences, indicating 
that they are pseudogenes in humans (Figure 4). In particular, Bh33 sequence has a 
complete open reading frame and potentially encodes a protein similar to the mouse or 

30 rat vomeronasal receptors from the VR1 family. Many of the conserved amino acids 
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in rat and mouse vomeronasal receptors are also conserved in Bh33. This discovery is 
particularly surprising and unexpected since only human pseudogenes have been 
found to date that bear homology to rat and mouse vomeronasal receptors. 

In addition, the present invention provides an N-terminal trunctated 

5 splice variant of hVLRl , as depicted in Figure 5B and detected in olfactory 

epithelium. Southern blot analysis of human genomic DNA with the VLR1 probe 
reveals a single band (Figure 5 A), for both human DNA digested with EcoRI (RI) or 
Hindlll (H), indicating that the VRL1 gene is not part of a human multigene 
subfamily of closely related proteins. 

10 A sequence corresponding to Bh33 can be found in GenBank (AC 

004076) embedded within a large sequence, namely, a 500 kb ZNF gene family of 
human chromosome 19, cosmid R30217. The GenBank sequence tentatively 
identified a partial open reading frame region that is similar to a rat pheromone 
receptor VN6 (u36898), with 27% identity at the deduced amino acid sequence level. 

15 However, the deduced amino acid sequence is not provided, so there is no basis for 
concluding from this information that the annotated region encodes anything other 
than a pseudogene. 

The term "hVLRl" or "V1RL1" as used herein refers to a primate, 
preferably human (RVLR1) form of the vomeronasal-like receptor. Such a receptor is 

20 characterized by one or more of the following distinct features: 

(a) it has two alternative start-site isoforms, a "short" form, which 
is about a 313 amino acid polypeptide bearing close homology to mouse and rat 
vomeronasal receptors, and a "long" form, which is about a 353 amino acid 
polypeptide, both with seven putative transmembrane domains; and an alternative 

25 splice form yielding an N-terminal truncated product of about 243 amino acids 

sharing homology with vomeronasal receptors on the last two-thirds of its amino acid 
sequence; 

(b) it is encoded by a cDNA of about 942 nucleotides ("short" 
form) or 1059 nucleotides ("long" form); 

30 (c) the amino acid sequence of the short form (and the long form. 
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over the corresponding region, i.e., excluding the first 40 amino acid residues) is 28% 
identical and 47% similar to the mouse pheromone receptor (emb\CAA73256) by 
BLASTX analysis. 

In a specific embodiment, the hVLRl has an amino acid sequence as 

5 shown in SEQ ID NO:2 or 4, or any 1 0 amino acid portion thereof In another 

embodiment, the hVLRl is an allelic variant having the sequence as shown in SEQ ID 
NO:2, with one or two amino acid differences: Ser201 to Phe; Ala229 to Asp; or 
both, or any 10 amino acid portion thereof Corresponding "long" form allelic 
variants are also contemplated. A VLR1 of the invention will have at least about 90% 

10 amino acid sequence identity with SEQ ID NO:2 or 4, preferably about 95% identity, 
and may have 99% or greater identity, as exemplified by the allelic variants described 
here. In specific embodiments, chimp VLRl(cVLRl) has 93% amino acid sequence 
identity with VIRLla (SEQ ID NO: 15)and V1RL1 b (SEQ ID NO: 16) and human 
polymorphic variants have 99% identity. 

15 BLAST (Basic Local Alignment Search Tool) program was used to 

search for homology. Specifically, the search algorithm, BLASTX, was used 
according to Altschul, et. al., "Gapped BLAST and PSI-BLAST: a new generation of 
protein database search programs", Nucleic Acids Res, 1990, 25:3389-3402, to 
compare the six-frame conceptual translation products of the hVLRl 942 nucleotide 

20 query sequence (both strands) against a protein sequence database. The fundamental 
unit of BLAST algorithm output is the High-scoring Segment Pair (HSP). An HSP 
consists of two sequence fragments of arbitrary but equal length whose alignment is 
locally maximal and for which the alignment score meets or exceeds a threshold or 
cutoff score. A set of HSPs is thus defined by two sequences, a scoring system, and a 

25 cutoff score; this set may be empty if the cutoff score is sufficiently high. In the 
programmatic implementations of the BLAST, each HSP consists of a segment from 
the query sequence and one from a database sequence (Altschul et al., supra). 

The approach to similarity searching taken by the BLAST programs is 
first to look for similar segments (HSPs) between the query sequence and a database 

30 sequence, then to evaluate the statistical significance of any matches that were found. 
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and finally to report only those matches that satisfy a user-selectable threshold pf 
significance (E parameter). The Expect value (E) is a parameter that describes the 
number of hits one can "expect" to see just by chance when searching a database of a 
particular size. It decreases exponentially with the Score (S) that is assigned to a 

5 match between two sequences. Essentially, the E value describes the random 

background noise that exists for matches between sequences. The value is used as a 
way to create significance threshold for reporting results. When the Expect value is 
increased from the default value of 10, a larger list with more low-scoring hits can be 
reported (Altschul et aL, supra). 

0 Table 1 illustrates the BLASTX results for the hVLRl shown in SEQ 

ID NO:2. hVLRl can be characterized by the relationships shown in this table: 
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Table 1 - Bh33 BLASTX search results. 



Sequence 
Identifier 


Sequence Description 


% 

Identity 


% 

Similarity 


Scor 
e 

HSP 


E 

value 


emb\CAA732 
56 


mouse pheromone* 
receptor 2 (Y12724) [Mus 
musculus] 


28 


47 


104 


7e-22 


pir\A57223 


rat VN3 pheromone 
receptor [1055248 
(U36895)] 


28 


45 


100 


2e-20 


pir\I61748 


rat VN6 pheromone 
receptor [1055254 
(U36898)] 


28 


44 


98 


7e-20 


pir\I61746 


rat VN4 pheromone 
receptor [1055250 
(U36896)] 


27 


45 


96 


4e-19 


emb\CAA732 
57 


mouse pheromone 
receptor 1 (Y12725) [Mus 
musculus] 


27 


45 


90 


3e-17 


pir\I61749 


rat VN2 pheromone 
receptor [1055256 
(U36899)] 


27 


44 


89 


6e-17 


gi\l 039470 


VN1 pheromone receptor 
(U36785) [Rattus 
norvegicus] 


26 


43 


85 


7e-16 


gi\l 039472 


VN7 pheromone receptor 
(U36786) [Rattus 
norvegicus] 


25 


44 


83 


2e-15 


pir\I61747 


rat VN5 pheromone 
receptor [1055252 
(U36897)] 


28 


46 


80 


2e-14 


emb\CAA770 
91 


mouse thyrotropin 
releasing hormone 
receptor (Y 18244) 


29 


48 


42 


0.005 



♦Although not so designated in the database, all of the receptors in this table are 
putative pheromone receptor. 
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hVLRl can also be characterized by being functional, i.e., it is 
expressed as a protein. Functionality of hVLRl of the invention includes binding 
pheromone-like substrate, and pheromone-like antagonist, which can be affected by 
GTP; G-protein binding; and signal transduction in response to binding vomeropherin 

5 or a vomeropherin agonist. Signal transduction may be evaluated by intracellular 
calcium mobilization, cyclic AMP accumulation, activation of other G-protein 
coupled signal transduction pathways, reporter gene expression coupled to G-protein 
signal transduction, and other methods. 

Various chimeric constructs comprising hVLRl are contemplated as 

10 well. Such constructs comprise an hVLRl fused to a heterologous amino acid 

sequence, e.g., having functional activity. For example, the hVLRl can be tagged 
with an N-terminal or C-terminal tag, such as Myc or FLAG, for 
immuno-precipitation. Alternatively, a signal sequence can be substituted for the 
endogenous signal sequence for more efficient processing into the rough endoplasmic 

15 reticulum, golgi, and cell membrane. Alternatively, an expression tag, such as an 

a-mating factor sequence for yeast expression, or residual amino acid residues from a 
recombinant construct, may be present. In yet another embodiment, a 
chromatographic tag or handle can be joined to hVLRl . For example, a polyhistidine 
sequence permits purification on a nickle chelation column. Various combinations of 

20 hVLRl and mouse or rat VNR segments yield alternative chimeric forms of the 
receptor. Other chimeric constructs, in which heterologous signal transduction 
domains or G-protein-binding domains are incorporated in the protein are discussed in 
greater detail, infra. 

Thus, the present invention advantageously provides a nucleic acid 

25 encoding a human vomeronasal-like receptor (hVLRl), the polypeptide encoded by 
this nucleic acid, cells stably expressing hVLRl, and methods for using such cells, 
e.g.. to screen for hVLRl agonists and antagonists, particularly agonists and 
antagonists that are selective for a vomeronasal-like receptor. 

In a specific embodiment, the nucleotide sequences encoding the 

30 amino acids comprising the novel receptor protein are depicted in SEQ ID NO: 1 or 
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SEQ ID NO:3 for hVLRl, or allelic variants thereof as described above. 

Receptors expressed from novel hVLRl DNAs may be expressed in 
eukaryotic and prokaryotic cells and can be used to develop and/or implement high 
throughput screens to identify novel pheromone-like agonists and antagonists. These 

5 novel DNAs may be used to help identify receptor subtype selective ligands and may 
be used to make chimeric and mutant vomeronasal-like receptors which can be used 
to identify critical ligand binding domains as well as to determine selectivity of 
ligands. These novel DNAs can be used to further investigate signal transduction 
systems of vomeronasal-like receptors as well as to determine tissue distribution of 

10 receptors. 

Translation of hVLRl cDNAs results in protein sequences which 
display many of the characteristics of G protein coupled receptors. The peptide 
sequences of these novel cDNAs may be used to generate antibodies. Antibodies to 
the receptor can be used to activate the receptor, e.g., by aggregating them. 

15 In a specific embodiment, the present invention provides a vector 

adapted for expression in a mammalian cell, which comprises the cDNA encoding the 
functional hVLRl. The term "adapted for expression in a mammalian cell" means that 
the regulatory elements necessary for the expression of the cDNA in the mammalian 
cell are present on the plasmid. 

20 The invention further provides a cDN A probe useful for detecting 

nucleic acid encoding the hVLRl receptor comprising a nucleic acid molecule of at 
least about 20 nucleotides having a sequence complementary to a sequence included 
within the sequence shown in SEQ ID NO: 1 or SEQ ID NO:3. It also provides 
antisense or triple-helix-forming oligonucleotides capable of suppressing expression 

25 ofhVLRl. 

In a specific embodiment, the term "about" or "approximately" means 
within 20%, preferably within 10%, and more preferably within 5% of a given value 
or range. Alternatively, and particularly with respect to biological responses, the term 
"about" means within an order of magnitude, preferably a factor of 5, and more 
30 preferably a factor of 2 of a give value. 



WO 01/25431 



PCT/US00/27211 



-14- 

As used herein, the term "isolated" means that the referenced material 
is free of components found in the natural environment in which the material is 
normally found or that the referenced material is present in a heterologous 
environment. In particular, isolated biological material is free of cellular components. 

5 In the case of nucleic acid molecules, an isolated nucleic acid includes a PCR product, 
an isolated mRNA, a cDNA, or a restriction fragment. In another embodiment, an 
isolated nucleic acid is preferably excised from the chromosome in which it may be 
found, and more preferably is no longer joined to non-regulatory, non-coding regions, 
or to other genes, located upstream or downstream of the gene contained by the 

10 isolated nucleic acid molecule when found in the chromosome. In yet another 

embodiment, the isolated nucleic acid lacks one or more introns. Isolated nucleic acid 
molecules can be inserted into plasmids, cosmids, artificial chromosomes, and the 
like. Thus, in a specific embodiment, a cloned or recombinant nucleic acid is an 
isolated nucleic acid. An isolated protein may be associated with other proteins or 

15 nucleic acids, or both, with which it associates in the cell, or with cellular membranes 
if it is a membrane-associated protein. An isolated organelle, cell, or tissue is removed 
from the anatomical site in which it is found in an organism. An isolated material 
may be, but need not be, purified. 

The term "purified" as used herein refers to material that has been 

20 isolated under conditions that reduce or eliminate unrelated materials, i.e., 

contaminants. For example, a purified protein is preferably substantially free of other 
proteins or nucleic acids with which it is associated in a cell; a purified nucleic acid 
molecule is preferably substantially free of proteins or other unrelated nucleic acid 
molecules with which it can be found within a cell. As used herein, the term 

25 "substantially free" is used operationally, in the context of analytical testing of the 
material. Preferably, purified material substantially free of contaminants is at least 
50% pure; more preferably, at least 90% pure, and more preferably still at least 99% 
pure. Purity can be evaluated by chromatography, gel electrophoresis, immunoassay, 
composition analysis, biological assay, and other methods known in the art. 

30 The use of italics indicates a nucleic acid molecule (e.g. , hVLRl , 
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cDNA, gene, etc.); normal text indicates the polypeptide or protein. 

In accordance with the present invention there may be employed 
conventional molecular biology, microbiology, and recombinant DNA techniques 
within the skill of the art. Such techniques are explained fully in the literature. See, 
5 e.g., Sambrook, Fritsch & Maniatis, Molecular Cloning: A Laboratory Manual, 
Second Edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
New York (herein "Sambrook et ai, 1989"); DNA Cloning: A Practical Approach 
Volumes I and II (D.N. Glover ed. 1985); Oligonucleotide Synthesis (M.J. Gait ed. 
1984); Nucleic Acid Hybridization [B.D. Hames & S.J. Higgins eds. (1985)]; 
10 Transcription And Translation [B.D. Hames & S.J. Higgins, eds. (1 984)]; Animal Cell 
Culture [R.I. Freshney, ed. (1986)]; Immobilized Cells And Enzymes [IRL Press, 
(1986)]; B.EPerbal, A Practical Guide To Molecular Cloning (1984); F.M. Ausubel et 
ah (eds.), Current Protocols in Molecular Biology, John Wiley & Sons, Inc. (1994). 

Molecular Biology - Definitions 
15 "Amplification" of DNA as used herein denotes the use of polymerase 

chain reaction (PCR) to increase the concentration of a particular DNA sequence 
within a mixture of DNA sequences. For a description of PCR see Saiki et al. 9 
Science, 239:487, 1988. 

"Chemical sequencing" of DNA denotes methods such as that of 
20 Maxam and Gilbert (Maxam-Gilbert sequencing, Maxam and Gilbert, Proc. Natl. 

Acad. Sci. USA, 74:560, 1977), in which DNA is randomly cleaved using individual 
base-specific reactions. 

"Enzymatic sequencing" of DNA denotes methods such as that of 
Sanger (Sanger et al % \977, Proc. Natl. Acad. Sci. USA, 74:5463, 1977), in which a 
25 single-stranded DNA is copied and randomly terminated using DNA polymerase, 
including variations thereof well-known in the art. 

As used herein, "sequence-specific oligonucleotides" refers to related 
sets of oligonucleotides that can be used to detect allelic variations or mutations in the 
hVLRl gene. 

30 A "nucleic acid molecule" refers to the phosphate ester polymeric form 
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of ribonucleosides (adenosine, guanosine, uridine or cytidine; "RNA molecules") or 
deoxyribonucleosides (deoxyadenosine, deoxyguanosine, deoxythymidine, or 
deoxycytidine; "DNA molecules"), or any phosphoester analogs thereof, such as 
phosphorothioates and thioesters, in either single stranded form, or a double-stranded 

5 helix. Double stranded DNA-DNA, DNA-RNA and RNA-RNA helices are possible. 
The term nucleic acid molecule, and in particular DNA or RNA molecule, refers only 
to the primary and secondary structure of the molecule, and does not limit it to any 
particular tertiary forms. Thus, this term includes double-stranded DNA found, inter 
alia, in linear (e.g., restriction fragments) or circular DNA molecules, plasmids, and 

10 chromosomes. In discussing the structure of particular double-stranded DNA 

molecules, sequences may be described herein according to the normal convention of 
giving only the sequence in the 5' to 3' direction along the nontranscribed strand of 
DNA (i.e., the strand having a sequence homologous to the mRNA). A "recombinant 
DNA molecule" is a DNA molecule that has undergone a molecular biological 

15 manipulation. 

A "polynucleotide" or "nucleotide sequence" is a series of nucleotide 
bases (also called "nucleotides") in DNA and RNA, and means any chain of two or 
more nucleotides. A nucleotide sequence typically carries genetic information, 
including the information used by cellular machinery to make proteins and enzymes. 

20 These terms include double or single stranded genomic and cDNA, RNA, any 

synthetic and genetically manipulated polynucleotide, and both sense and anti-sense 
polynucleotide (although only sense stands are being represented herein). This 
includes single- and double-stranded molecules, i.e., DNA-DNA, DNA-RNA and 
RNA-RNA hybrids, as well as "protein nucleic acids" (PNA) formed by conjugating 

25 bases to an amino acid backbone. This also includes nucleic acids containing 
modified bases, for example thio-uracil. thio-guanine and fluoro-uracil. 

The polynucleotides herein may be flanked by natural regulatory 
(expression control) sequences, or may be associated with heterologous sequences, 
including promoters, internal ribosome entry sites (IRES) and other ribosome binding 

30 site sequences, enhancers, response elements, suppressors, signal sequences. 
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polyadenylation sequences, introns, 5 - and 3'- non-coding regions, and the like. The 
nucleic acids may also be modified by many means known in the art. Non-limiting 
examples of such modifications include methylation, "caps", substitution of one or 
more of the naturally occurring nucleotides with an analog, and internucleotide 

5 modifications such as, for example, those with uncharged linkages (e.g. , methyl 
phosphonates, phosphotriesters, phosphoroamidates, carbamates, etc.) and with 
charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.). Polynucleotides 
may contain one or more additional covalently linked moieties, such as, for example, 
proteins (e.g., nucleases, toxins, antibodies, signal peptides, poly-L-lysine, etc.), 

10 intercalators (e.g., acridine, psoralen, etc.), chelators (e.g., metals, radioactive metals, 
iron, oxidative metals, etc.), and alkylators. The polynucleotides may be derivatized 
by formation of a methyl or ethyl phosphotriester or an alkyl phosphoramidate 
linkage. Furthermore, the polynucleotides herein may also be modified with a label 
capable of providing a detectable signal, either directly or indirectly. Exemplary 

15 labels include radioisotopes, fluorescent molecules, biotin, and the like. 

A "polymorphism" as used herein denotes a variation in the nucleotide 
sequence of a gene in an individual. Genes that have different nucleotide sequences 
as a result of a polymorphism are "alleles." A "polymorphic position" is a 
predetermined nucleotide position within the sequence. In some cases, genetic 

20 polymorphisms are reflected by an amino acid sequence variation, and thus a 

polymorphic position can result in location of a polymorphism in the amino acid 
sequence at a predetermined position in the sequence of a polypeptide. An individual 
"homozygous" for a particular polymorphism is one in which both copies of the gene 
contain the same sequence at the polymorphic position. An individual "heterozygous" 

25 for a particular polymorphism is one in which the two copies of the gene contain 
different sequences at the polymorphic position. 

A "polymorphism pattern" as used herein denotes a set of one or more 
polymorphisms, including without limitation single nucleotide polymorphisms, which 
may be contained in the sequence of a single gene or a plurality of genes. In the 

30 simplest case, a polymorphism pattern can consist of a single nucleotide 
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polymorphism in only one position of one of two alleles of an individual . However, 
one has to look at both copies of a gene. A polymorphism pattern that is appropriate 
for assessing a particular aspect of cardiovascular status (e.g., predisposition to 
hypertension) need not contain the same number (nor identity, of course) of 

5 polymorphisms as a polymorphism pattern that would be appropriate for assessing 
another aspect of cardiovascular status (e.g., responsivity to ACE inhibitors for 
control of hypertension). A "test polymorphism pattern" as used herein is a 
polymorphism pattern determined for a human subject of undefined cardiovascular 
status. A "reference polymorphism pattern" as used herein is determined from a 

1 o statistically significant correlation of patterns in a population of individuals with pre- 
determined cardiovascular status. 

The term "host cell" means any cell of any organism that is selected, 
modified, transformed, grown, or used or manipulated in any way, for the production 
of a substance by the cell, for example the expression by the cell of a gene, a DNA or 

15 RNA sequence, a protein or an enzyme. Host cells can further be used for screening 
or other assays, as described infra. 

Proteins and enzymes are made in the host cell using instructions in 
DNA and RNA, according to the genetic code. Generally, a DNA sequence having 
instructions for a particular protein or enzyme is "transcribed" into a corresponding 

20 sequence of RNA. The RNA sequence in turn is "translated" into the sequence of 
amino acids which form the protein or enzyme. An "amino acid sequence" is any 
chain of two or more amino acids. Each amino acid is represented in DNA or RNA 
by one or more triplets of nucleotides. Each triplet forms a codon, corresponding to 
an amino acid. For example, the amino acid lysine (Lys) can be coded by the 

25 nucleotide triplet or codon AAA or by the codon A AG. (The genetic code has some 
redundancy, also called degeneracy, meaning that most amino acids have more than 
one corresponding codon.) Because the nucleotides in DNA and RNA sequences are 
read in groups of three for protein production, it is important to begin reading, the 
sequence at the correct amino acid, so that the correct triplets are read. The way that a 

30 nucleotide sequence is grouped into codons is called the "reading frame." 
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A "coding sequence" or a sequence "encoding" an expression product, 
such as a RNA, polypeptide, protein, or enzyme, is a nucleotide sequence that, when 
expressed, results in the production of that RNA, polypeptide, protein, or enzyme, i.e.. 
the nucleotide sequence encodes an amino acid sequence for that polypeptide, protein 
5 or enzyme. A coding sequence for a protein may include a start codon (usually ATG) 
and a stop codon. 

The term "gene", also called a "structural gene" means a DNA 
sequence that codes for or corresponds to a particular sequence of amino acids which 
comprise all or part of one or more proteins or enzymes, and may or may not include 

10 regulatory DNA sequences, such as promoter sequences, which determine for example 
the conditions under which the gene is expressed. Some genes, which are not 
structural genes, may be transcribed from DNA to RNA, but are not translated into an 
amino acid sequence. Other genes may function as regulators of structural genes or as 
regulators of DNA transcription. 

15 A "promoter sequence" is a DNA regulatory region capable of binding 

RNA polymerase in a cell and initiating transcription of a downstream (3 1 direction) 
coding sequence. For purposes of defining the present invention, the promoter 
sequence is bounded at its 3' terminus by the transcription initiation site and extends 
upstream (5' direction) to include the minimum number of bases or elements 

20 necessary to initiate transcription at levels detectable above background. Within the 
promoter sequence will be found a transcription initiation site (conveniently defined 
for example, by mapping with nuclease SI), as well as protein binding domains 
(consensus sequences) responsible for the binding of RNA polymerase. 

A coding sequence is "under the control" or "operatively associated 

25 with" of transcriptional and translational control sequences in a cell when RNA 
polymerase transcribes the coding sequence into mRNA, which is then trans-RNA 
spliced (if it contains introns) and translated into the protein encoded by the coding 
sequence. 

The terms "express" and "expression" mean allowing or causing the 
30 information in a gene or DNA sequence to become manifest, for example producing 
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mRNA or a protein by activating the cellular functions involved in transcription and 
translation of a corresponding gene or DNA sequence. A DNA sequence is expressed 
in or by a cell to form an "expression product" such as an mRNA or a protein. The 
expression product itself, e.g. the resulting mRNA or protein, may also be said to be 

5 "expressed" by the cell. A protein expression product can be characterized as 

intracellular, membrane, or secreted. The term "intracellular" means something that is 
inside a cell. The term "membrane" means something that is in the cell membrane. A 
substance is "secreted" by a cell if it appears in significant measure outside the cell, 
from somewhere on or inside the cell. 

10 The term "transfection" means the introduction of a foreign nucleic 

acid into a cell. The term "transformation" means the introduction of a "foreign" (i.e. 
extrinsic or extracellular) gene, DNA or RNA sequence to a host cell, so that the host 
cell will express the introduced gene or sequence to produce a desired substance, 
typically a protein or enzyme coded by the introduced gene or sequence. The 

15 introduced gene or sequence may also be called a "cloned" or "foreign" gene or 

sequence, may include regulatory or control sequences, such as start, stop, promoter, 
signal, secretion, or other sequences used by a cell's genetic machinery. The gene or 
sequence may include nonfunctional sequences or sequences with no known function. 
A host cell that receives and expresses introduced DNA or RNA has been 

20 "transformed" and is a "transformant" or a "clone." The DNA or RNA introduced to a 
host cell can come from any source, including cells of the same genus or species as 
the host cell, or cells of a different genus or species. 

The terms "vector", "cloning vector" and "expression vector" mean the 
vehicle by which a DNA or RNA sequence (e.g. a foreign gene) can be introduced 

25 into a host cell, so as to transform the host and promote expression (e.g. transcription 
and translation) of the introduced sequence. Vectors include plasmids, phages, 
viruses, etc.; they are discussed in greater detail below. 

Vectors typically comprise the DNA of a transmissible agent, into 
which foreign DNA is inserted. A common way to insert one segment of DNA into 

30 another segment of DNA involves the use of enzymes called restriction enzymes that 
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cleave DNA at specific sites (specific groups of nucleotides) called restriction sites. A 
"cassette" refers to a DNA coding sequence or segment of DNA that codes for an 
expression product that can be inserted into a vector at defined restriction sites. The 
cassette restriction sites are designed to ensure insertion of the cassette in the proper 
5 reading frame. Generally, foreign DNA is inserted at one or more restriction sites of 
the vector DNA, and then is carried by the vector into a host cell along with the 
transmissible vector DNA. A segment or sequence of DNA having inserted or added 
DNA, such as an expression vector, can also be called a "DNA construct." A 
common type of vector is a "plasmid", which generally is a self-contained molecule 

10 of double-stranded DNA, usually of bacterial origin, that can readily accept additional 
(foreign) DNA and which can readily introduced into a suitable host cell. A plasmid 
vector often contains coding DNA and promoter DNA and has one or more restriction 
sites suitable for inserting foreign DNA. Coding DNA is a DNA sequence that 
encodes a particular amino acid sequence for a particular protein or enzyme. 

15 Promoter DNA is a DNA sequence which initiates, regulates, or otherwise mediates or 
controls the expression of the coding DNA. Promoter DNA and coding DNA may be 
from the same gene or from different genes, and may be from the same or different 
organisms. A large number of vectors, including plasmid and fungal vectors, have 
■ been described for replication and/or expression in a variety of eukaryotic and 

20 prokaryotic hosts. Non-limiting examples include pKK plasmids (Clonetech), pUC 
plasmids, pET plasmids (Novagen, Inc., Madison, WI), pRSET or pREP plasmids 
(Invitrogen, San Diego, CA), or pMAL plasmids (New England Biolabs, Beverly, 
MA), and many appropriate host cells, using methods disclosed or cited herein or 
otherwise known to those skilled in the relevant art. Recombinant cloning vectors 

25 will often include one or more replication systems for cloning or expression, one or 
more markers for selection in the host, e.g. antibiotic resistance, and one or more 
expression cassettes. 

The term "expression system" means a host cell and compatible vector 
under suitable conditions, e.g. for the expression of a protein coded for by foreign 

30 DNA carried by the vector and introduced to the host cell. Common expression 
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systems include E. coli host cells and plasmid vectors, insect host cells and 
Baculovirus vectors, and mammalian host cells and vectors. hVLRl may be 
expressed in PC 12, COS-1, or C 2 C 12 cells. Other suitable cells include CHO cells, 
HeLa cells, 293T (human kidney cells), mouse primary myoblasts, and NIH 3T3 cells. 

5 The term "heterologous" refers to a combination of elements not 

naturally occurring. For example, heterologous DNA refers to DNA not naturally 
located in the cell, or in a chromosomal site of the cell. Preferably, the heterologous 
DNA includes a gene foreign to the cell. A heterologous expression regulatory 
element is a such an element operatively associated with a different gene than the one 

10 it is operatively associated with in nature. In the context of the present invention, an 
hVLRl gene is heterologous to the vector DNA in which it is inserted for cloning or 
expression, and it is heterologous to a host cell containing such a vector, in which it is 
expressed, e.g., a CHO cell. 

The terms "mutant" and "mutation" mean any detectable change in 

15 genetic material, e.g. DNA, or any process, mechanism, or result of such a change. 
This includes gene mutations, in which the structure {e.g. DNA sequence) of a gene is 
altered, any gene or DNA arising from any mutation process, and any expression 
product (e.g. protein or enzyme) expressed by a modified gene or DNA sequence. 
The term "variant" may also be used to indicate a modified or altered gene, DNA 

20 sequence, enzyme, cell, etc., i.e., any kind of mutant. 

"Sequence-conservative variants" of a polynucleotide sequence are 
those in which a change of one or more nucleotides in a given codon position results 
in no alteration in the amino acid encoded at that position. 

"Function-conservative variants" are those in which a given amino acid 

25 residue in a protein or enzyme has been changed without altering the overall 
conformation and function of the polypeptide, including, but not limited to, 
replacement of an amino acid with one haying similar properties (such as, for 
example, polarity, hydrogen bonding potential, acidic, basic, hydrophobic, aromatic, 
and the like). Amino acids with similar properties are well known in the art. For 

30 example, arginine, histidine and lysine are hydrophilic-basic amino acids and may be 
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interchangeable. Similarly, isoleucine, a hydrophobic amino acid, may be replaced 
with leucine, methionine or valine. Such changes are expected to have little or no 
effect on the apparent molecular weight or isoelectric point of the protein or 
polypeptide. Amino acids other than those indicated as conserved may differ in a 

5 protein or enzyme so that the percent protein or amino acid sequence similarity 

between any two proteins of similar function may vary and may be, for example, from 
70% to 99% as determined according to an alignment scheme such as by the Cluster 
Method, wherein similarity is based on the MEGALIGN algorithm. A 
"function-conservative variant" also includes a polypeptide or enzyme which has at 

10 least 60 % amino acid identity as determined by BLAST or FASTA algorithms, 
preferably at least 75%, most preferably at least 85%, and even more preferably at 
least 90%, and which has the same or substantially similar properties or functions as 
the native or parent protein or enzyme to which it is compared. 

As used herein, the term "homologous" in all its grammatical forms 

15 and spelling variations refers to the relationship between proteins that possess a 
"common evolutionary origin," including proteins from superfamilies (e.g., the 
immunoglobulin superfamily) and homologous proteins from different species (e.g., 
myosin light chain, etc.) (Reeck et aL % Cell 50:667, 1987). Such proteins (and their 
encoding genes) have sequence homology, as reflected by their sequence similarity, 

20 whether in terms of percent similarity or the presence of specific residues or motifs at 
conserved positions. 

Accordingly, the term "sequence similarity" in all its grammatical 
forms refers to the degree of identity or correspondence between nucleic acid or 
amino acid sequences of proteins that may or may not share a common evolutionary 

25 origin (see Reeck et al , supra). However, in common usage and in the instant 

application, the term "homologous," when modified with an adverb such as "highly," 
may refer to sequence similarity and may or may not relate to a common evolutionary 
origin. 

In a specific embodiment, two DNA sequences are "substantially 
30 homologous" or "substantially similar" when at least about 80%, and most preferably 
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at least about 90 or 95%, of the nucleotides match over the defined length of the DNA 
sequences, as determined by sequence comparison algorithms, such as BLAST, 
FASTA, DNA Strider, etc. An example of such a sequence is an allelic variant of the 
specific hVLRl genes of the invention. Sequences that are substantially homologous 

5 can be identified by comparing the sequences using standard software available in 
sequence data banks, or in a Southern hybridization experiment under, for example, 
stringent conditions as defined for that particular system. 

Similarly, in a particular embodiment, two amino acid sequences are 
"substantially homologous" or "substantially similar" when greater than 80% of the 

10 amino acids are identical, or greater than about 90% are similar (functionally 
identical). Preferably, the similar or homologous sequences are identified by 
alignment using, for example, the GCG (Genetics Computer Group, Program Manual 
for the GCG Package, Version 7, Madison, Wisconsin) pileup program, or any of the 
programs described above (BLAST, FASTA). 

15 A nucleic acid molecule is "hybridizable" to another nucleic acid 

molecule, such as a cDNA, genomic DNA, or RNA, when a single stranded form of 
the nucleic acid molecule can anneal to the other nucleic acid molecule under the 
appropriate conditions of temperature and solution ionic strength (see Sambrook et 
al, supra). The conditions of temperature and ionic strength determine the 

20 "stringency" of the hybridization. For preliminary screening for homologous nucleic 
acids, low stringency hybridization conditions, corresponding to a T m (melting 
temperature) of 55°C, can be used, e.g., 5x SSC, 0.1% SDS, 0.25% milk, and no 
formamide; or 30% formamide, 5x SSC, 0.5% SDS). Moderate stringency 
hybridization conditions correspond to a higher T m , e.g., 40% formamide, with 5x or 

25 6x SCC. High stringency hybridization conditions correspond to the highest T m , e.g., 
50% formamide, 5x or 6x SCC. SCC is a 0.15M NaCl, 0.015M Na-citrate. 
Hybridization requires that the two nucleic acids contain complementary sequences, 
although depending on the stringency of the hybridization, mismatches between bases 
are possible. The appropriate stringency for hybridizing nucleic acids depends on the 

30 length of the nucleic acids and the degree of complementation, variables well known 
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in the art. The greater the degree of similarity or homology between two nucleotide 
sequences, the greater the value of T m for hybrids of nucleic acids having those 
sequences. The relative stability (corresponding to higher T m ) of nucleic acid 
hybridizations decreases in the following order: RNA:RNA 5 DNArRNA, DNA:DNA. 

5 For hybrids of greater than 1 00 nucleotides in length, equations for calculating T m 
have been derived (see Sambrook et aL % supra, 9.50-9.51). For hybridization with 
shorter nucleic acids, ie. 9 oligonucleotides, the position of mismatches becomes more 
important, and the length of the oligonucleotide determines its specificity (see 
Sambrook et aL, supra, 1 1 .7-1 1 .8). A minimum length for a hybridizable nucleic acid 

10 is at least about 10 nucleotides; preferably at least about 15 nucleotides; and more 
preferably the length is at least about 20 nucleotides. 

In a specific embodiment, the term "standard hybridization conditions" 
refers to a T m of 55 °C, and utilizes conditions as set forth above. In a preferred 
embodiment, the T m is 60 °C; in a more preferred embodiment, the T m is 65 °C. In a 

15 specific embodiment, "high stringency" refers to hybridization and/or washing 
conditions at 68°C in 0.2XSSC, at 42°C in 50% formamide, 4XSSC, or under 
conditions that afford levels of hybridization equivalent to those observed under either 
of these two conditions. 

As used herein, the term "oligonucleotide" refers to a nucleic acid, 

20 generally of at least 10, preferably at least 15, and more preferably at least 20 
nucleotides, preferably no more than 100 nucleotides, that is hybridizable to a 
genomic DNA molecule, a cDNA molecule, or an mRNA molecule encoding a gene, 
mRNA. cDNA, or other nucleic acid of interest. Oligonucleotides can be labeled, 
e.g., with 32 P-nucleotides or nucleotides to which a label, such as biotin, has been 

25 covalently conjugated. In one embodiment, a labeled oligonucleotide can be used as a 
probe to detect the presence of a nucleic acid. In another embodiment, 
oligonucleotides (one or both of which may be labeled) can be used as PCR primers, 
either for cloning full length or a fragment of hVLRl , or to detect the presence of 
nucleic acids encoding hVLRl. In a further embodiment, an oligonucleotide of the 

30 invention can form a triple helix with a hVLRl DNA molecule. Generally, 
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oligonucleotides are prepared synthetically, preferably on a nucleic acid synthesizer. 
Accordingly, oligonucleotides can be prepared with non-naturally occurring 
phosphoester analog bonds, such as thioester bonds, etc. 

The present invention provides antisense nucleic acids (including 

5 ribozymes), which may be used to inhibit expression of hVLRl of the invention. An 
"antisense nucleic acid" is a single stranded nucleic acid molecule which, on 
hybridizing under cytoplasmic conditions with complementary bases in an RNA or 
DNA molecule, inhibits the latter's role. If the RNA is a messenger RNA transcript, 
the antisense nucleic acid is a countertranscript or mRNA-interfering complementary 

10 nucleic acid. As presently used, "antisense" broadly includes RNA-RNA interactions, 
RNA-DNA interactions, ribozymes and RNase-H mediated arrest. Antisense nucleic 
acid molecules can be encoded by a recombinant gene for expression in a cell (e.g. , 
U.S. Patent No. 5,814,500; U.S. Patent No. 5,811,234), or alternatively they can be 
prepared synthetically (e.g. , U.S. Patent No. 5,780,607). 

15 Specific non-limiting examples of synthetic oligonucleotides 

envisioned for this invention include oligonucleotides that contain phosphorothioates, 
phosphotriesters, methyl phosphonates, short chain alkyl, or cycloalkl intersugar 
linkages or short chain heteroatomic or heterocyclic intersugar linkages. Most 
preferred are those with CH 2 -NH-0-CH 2 , CH 2 -N(CH 3 )-0-CH 2 , CH 2 -0-N(CH 3 )-CH 2 , 

20 CH 2 -N(CH 3 )-N(CH 3 )-CH 2 and 0-N(CH 3 )-CH 2 -CH 2 backbones (where 

phosphodiester is 0-P0 2 -0-CH 2 ). US Patent No. 5,677,437 describes heteroaromatic 
olignucleoside linkages. Nitrogen linkers or groups containing nitrogen can also be 
used to prepare oligonucleotide mimics (U.S. Patents No. 5,792,844 and No. 
5,783,682). US Patent No. 5,637,684 describes phosphoramidate and 

25 phosphorothioamidate oligomeric compounds. Also envisioned are oligonucleotides 
having morpholino backbone structures (U.S. Pat. No. 5,034,506). In other 
embodiments, such as the peptide-nucleic acid (PNA) backbone, the phosphodiester 
backbone of the oligonucleotide may be replaced with a polyamide backbone, the 
bases being bound directly or indirectly to the aza nitrogen atoms of the polyamide 

30 backbone (Nielsen et al, Science 254:1497. 1991). Other synthetic oligonucleotides 
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may contain substituted sugar moieties comprising one of the following at the 2 1 
position: OH, SH, SCH 3 , F, OCN, 0(CH 2 ) n NH 2 or 0(CH 2 ) n CH 3 where n is from 1 to 
about 10; C, to C l0 lower alkyl. substituted lower alkyl, alkaryl or aralkyl; CI; Br; CN; 
CF 3 ; OCF 3 ; 0-; S-, or N-alkyl; 0-, S-, or N-alkenyl; SOCH 3 ; SO : CH 3 ; ON0 2 ;N0 2 ; 
5 N 3 ; NH 2 ; heterocycloalkyl; heterocycloalkaryl; aminoalkylamino; polyalkylamino; 
substituted silyl; a fluorescein moiety; an RNA cleaving group; a reporter group; an 
intercalator; a group for improving the pharmacokinetic properties of an 
oligonucleotide; or a group for improving the pharmacodynamic properties of an 
oligonucleotide, and other substituents having similar properties. Oligonucleotides 

10 may also have sugar mimetics such as cyclobutyls or other carbocyclics in place of the 
pentofuranosyl group. Nucleotide units having nucleosides other than adenosine, 
ytidine, guanosine, thymidine and uridine, such as inosine, may be used in an 
oligonucleotide molecule. 

Nucleic Acids Encoding hVLRl Proteins 

15 The present invention contemplates isolation of a gene encoding a 

hVLRl of the invention, including a full length, or naturally occurring form of 
hVLRl, allelic variants and splice variants thereof, and any antigenic fragments 
thereof from any human source. 

A gene encoding hVLRl, whether genomic DNA or cDNA, can be 

20 isolated from any source, particularly from a human cDNA or genomic library. 
Methods for obtaining hVLRl gene are well known in the art, as described above 
(see, e.g., Sambrook et al, 1989, supra). The DNA may be obtained by standard 
procedures known in the art from cloned DNA (e.g., a DNA "library"), and preferably 
is obtained from a cDNA library prepared from tissues with high level expression of 

25 the protein (e.g., an olfactory epithelium library, since these are the cells that evidence 
high levels of expression of hVLRl), by chemical synthesis, by cDNA cloning, or by 
the cloning of genomic DNA, or fragments thereof, purified from the desired cell 
(See, for example, Sambrook et al, 1 989, supra; Glover, D.M. (ed.), 1985, DNA 
Cloning: A Practical Approach, MRL Press, Ltd., Oxford, U.K. Vol. I, II). Clones 

30 derived from genomic DNA may contain regulatory DNA regions in addition to 
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coding regions. Whatever the source, the gene may be molecularly cloned into a 
suitable vector for propagation of the gene. Identification of the specific DNA 
fragment containing the desired hVLRl gene may be accomplished in a number of 
ways. For example, a portion of a hVLRl gene exemplified infra can be purified and 

5 labeled to prepare a labeled probe, and the generated DNA may be screened by 

nucleic acid hybridization to the labeled probe (Benton and Davis, Science 196:180, 
1977; Grunstein and Hogness, Proc. Natl. Acad. Sci. U.S.A. 72:3961, 1975). Those 
DNA fragments with substantial homology to the probe, such as an allelic variant 
from another individual, will hybridize. In a specific embodiment, highest stringency 

10 hybridization conditions are used to identify a homologous hVLRl gene. 

Further selection can be carried out on the basis of the properties of the 
gene, e.g., if the gene encodes a protein product having the isoelectric, electrophoretic, 
amino acid composition, partial or complete amino acid sequence, antibody binding 
activity, or ligand binding profile of hVLRl protein as disclosed herein. Thus, the 

1 5 presence of the gene may be detected by assays based on the physical, chemical, 
immunological, or functional properties of its expressed product. 

Identification of Polymorphisms 
The invention specifically contemplates isolating and characterizing 
allelic variants of hVLRl having various polymorphisms, and splice variants. 

20 An individual's polymorphisms pattern can be established, e.g., by 

obtaining DNA from the individual and determining the sequence at a predetermined 
polymorphic position or positions in a gene, or more than one gene. 

The DNA may be obtained from any cell source. Non-limiting 
examples of cell sources available in clinical practice include without limitation blood 

25 cells, buccal cells, cervicovaginal cells, epithelial cells from urine, fetal cells, or any 
cells present in tissue obtained by biopsy. Cells may also be obtained from body 
fluids, including without limitation blood, saliva, sweat, urine, cerebrospinal fluid, 
feces, and tissue exudates at the site of infection or inflammation. DNA is extracted 
from the cell source or body fluid using any of the numerous methods that are 

30 standard in the art. It will be understood that the particular method used to extract 
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DNA will depend on the nature of the source. 

Determination of the sequence of the extracted DNA at polymorphic 
positions is achieved by any means known in the art, including but not limited to 
direct sequencing, hybridization with allele-specific oligonucleotides, allele-specific 

5 PCR, ligase-PCR, HOT cleavage, denaturing gradient gel electrophoresis (DGGE), 
and single-stranded conformational polymorphism (SSCP). Direct sequencing may be 
accomplished by any method, including without limitation chemical sequencing, using 
the Maxam-Gilbert method; by enzymatic sequencing, using the Sanger method; 
mass spectrometry sequencing; and sequencing using a chip-based technology. See, 

10 e.g., Little et al, Genet. Anal. 6:151, 1996. Preferably, DNA from a subject is first 
subjected to amplification by polymerase chain reaction (PCR) using specific 
amplification primers. 

In an alternate embodiment, biopsy tissue is obtained from a subject. 
Antibodies that are capable of distinguishing between different polymorphic forms of 

15 hVLRl are then applied to samples of the tissue to determine the presence or absence 
of a polymorphic form specified by the antibody. The antibodies may be polyclonal 
or monoclonal, preferably monoclonal. Measurement of specific antibody binding to 
cells may be accomplished by any known method, e.g., quantitative flow cytometry, 
or enzyme-linked or fluorescence-linked immunoassay. The presence or absence of a 

20 particular polymorphism and its allelic distribution (i.e. , homozygosity vs. 

heterozygosity) is determined by comparing the values obtained from a patient with 
norms established from populations of patients having known polymorphic patterns. 

In another alternate embodiment, RNA is isolated from biopsy tissue 
using standard methods well known to those of ordinary skill in the art, such as 

25 guanidium thiocyanate-phenol-chloroform extraction (Chomocyznski et al 9 1 987, 
Anal. Biochem., 162:156.) The isolated RNA is then subjected to coupled reverse 
transcription and amplification by polymerase chain reaction (RT-PCR), using 
specific oligonucleotide primers that are specific for a selected polymorphism. 
Conditions for primer annealing are chosen to ensure specific reverse transcription 

30 and amplification; thus, the appearance of an amplification product is diagnostic of 
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the presence of a particular polymorphism. In another embodiment, RNA is reverse- 
transcribed and amplified, after which the amplified sequences are identified by, e.g., 
direct sequencing. In still another embodiment, cDNA obtained from the RNA can be 
cloned and sequenced to identify a polymorphism. 

5 hVLRlAnaloes 

The present invention also relates to cloning vectors containing genes 
encoding analogs and derivatives of hVLRl of the invention, that have the same or 
homologous functional activity as hVLRl. The production and use of derivatives and 
analogs related to hVLRl are within the scope of the present invention. In a specific 

10 embodiment, the derivative or analog is functionally active, /'. e. , capable of exhibiting 
one or more functional activities associated with a full-length, wild-type hVLRl of the 
invention. Such functions include pheromone or other ligand binding, G protein 
binding and activation, and localization to the cell membrane. In another 
embodiment, an hVLRl chimeric construct containing a different cytoplasmic 

15 domain, e.g. , having an intracellular signaling sequence from another receptor protein, 
can be prepared. Other chimeric or fusion proteins are also contemplated. Examples 
include chimeric proteins with G-protein binding domains from the other G protein 
coupled receptors (GPCRs), GFP fusions, epitope tagged proteins, etc. 

hVLRl derivatives can be made by altering encoding nucleic acid 

20 sequences by substitutions, additions or deletions that provide for functionally 
equivalent molecules. In a specific embodiment, infra, a deletion derivative of 
hVLRl is prepared and found to have ligand binding and signal transduction 
properties in the assays used to evaluate the proteins. Preferably, derivatives are made 
that have enhanced or increased functional activity relative to native hVLRl . 

25 Alternatively, such derivatives may encode soluble fragments of hVLRl, or fragments 
of hVLRl that contain the extracellular domain that have the same or greater affinity 
for pheromone-like substrates or other ligands of hVLRl . 

Due to the degeneracy of nucleotide coding sequences, other DNA 
sequences that encode substantially the same amino acid sequence as a hVLRl gene 

30 may be used in the practice of the present invention. These include but are not limited 



WO 01/25431 



PCT/US00/27211 



-31- 

to allelic genes and nucleotide sequences comprising all or portions of hVLRl genes 
which are altered by the substitution of different codons that encode the same amino 
acid residue within the sequence, thus producing a silent change (sequence 
conservative variants). Likewise, the hVLRl derivatives of the invention include, but 

5 are not limited to, those containing, as a primary amino acid sequence, all or part of 
the amino acid sequence of a hVLRl protein including altered sequences in which 
functionally equivalent amino acid residues are substituted for residues within the 
sequence resulting in a conservative amino acid substitution (e.g., functional 
conservative variants). For example, one or more amino acid residues within the 

10 sequence can be substituted by another amino acid of a similar polarity and, if present, 
charge, which acts as a functional equivalent, resulting in a silent alteration. 
Substitutes for an amino acid within the sequence may be selected from other 
members of the class to which the amino acid belongs. For example, the nonpolar 
(hydrophobic) amino acids include alanine, leucine, isoleucine, valine, proline, 

15 phenylalanine, tryptophan and methionine. Amino acids containing aromatic ring 
structures are phenylalanine, tryptophan, and tyrosine. The polar neutral amino acids 
include glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine. The 
positively charged (basic) amino acids include arginine, lysine and histidine. The 
negatively charged (acidic) amino acids include aspartic acid and glutamic acid. Such 

20 alterations will not be expected to affect apparent molecular weight as determined by 
polyacrylamide gel electrophoresis, or isoelectric point. Particularly preferred 
substitutions are: 

- Lys for Arg and vice versa such that a positive charge may be maintained; 

- Glu for Asp and vice versa such that a negative charge may be maintained; 
25 - Ser for Thr such that a free -OH can be maintained; and 

- Gin for Asn such that a free CONH 2 can be maintained. 

Amino acid substitutions may also be introduced to substitute an amino 
acid with a particularly preferable property. For example, a Cys may be introduced a 
potential site for disulfide bridges with another Cys. 
30 The genes encoding hVLRl derivatives and analogs of the invention 
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can be produced by various methods known in the art. The manipulations which 
result in their production can occur at the gene or protein level. For example, the 
cloned hVLRl gene sequence can be modified by any of numerous strategies known 
in the art (Sambrook et al , 1 989, supra). The sequence can be cleaved at appropriate 

5 sites with restriction endonuclease(s), followed by further enzymatic modification if 
desired, isolated, and ligated in vitro. In the production of the gene encoding a 
derivative or analog of hVLRl, care should be taken to ensure that the modified gene 
remains within the same translational reading frame as the hVLRl gene, uninterrupted 
by translational stop signals, in the gene region where the desired activity is encoded. 

1 o Additionally, the hVLRl -encoding nucleic acid sequence can be 

mutated in vitro or in vivo, to create and/or destroy translation, initiation, and/or 
termination sequences, or to create variations in coding regions and/or form new 
restriction endonuclease sites or destroy preexisting ones, to facilitate further in vitro 
modification. In the Examples, infra, such modifications were made to introduce 

15 restriction sites and facilitate cloning the hVLRl gene into an expression vector. Any 
technique for mutagenesis known in the art can be used, including but not limited to, 
in vitro site-directed mutagenesis (Hutchinson, C, et al, J. Biol. Chem. 253:6551, 
1978; Zoller and Smith, DNA 3:479-488, 1984; Oliphant et al, Gene 44:177, 1986; 
Hutchinson et al, Proc. Natl Acad. Sci. U.S.A. 83:710, 1986), use of TAB" linkers 

20 (Pharmacia), etc. PCR techniques are preferred for site directed mutagenesis (see 
Higuchi, 1989, "Using PCR to Engineer DNA", in PCR Technology: Principles and 
Applications for DNA Amplification, H. Erlich, ed., Stockton Press, Chapter 6, pp. 61- 
70). 

The identified and isolated gene can then be inserted into an 
25 appropriate cloning vector. A large number of vector-host systems known in the art 
may be used. Possible vectors include, but are not limited to, plasmids or modified 
viruses, but the vector system must be compatible with the host cell used. Examples 
of vectors include, but are not limited to, E. coli, bacteriophages such as lambda 
derivatives, or plasmids such as pBR322 derivatives or pUC plasmid derivatives, e.g.. 
30 pGEX vectors, pmal-c, pFLAG, etc. The insertion into a cloning vector can, for 
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example, be accomplished by ligating the DNA fragment into a cloning vector which 
has complementary cohesive termini. However, if the complementary restriction sites 
used to fragment the DNA are not present in the cloning vector, the ends of the DNA 
molecules may be enzymatically modified. Alternatively, any site desired may be 

5 produced by ligating nucleotide sequences (linkers) onto the DNA termini; these 
ligated linkers may comprise specific chemically synthesized oligonucleotides 
encoding restriction endonuclease recognition sequences. 

Recombinant molecules can be introduced into host cells via 
transformation, transfection, infection, electroporation, etc., so that many copies of the 

10 gene sequence are generated. Preferably, the cloned gene is contained on a shuttle 
vector plasmid, which provides for expansion in a cloning cell, e.g., E. coli, and facile 
purification for subsequent insertion into an appropriate expression cell line, if such is 
desired. For example, a shuttle vector, which is a vector that can replicate in more 
than one type of organism, can be prepared for replication in both E. coli and 

15 Saccharomyces cerevisiae by linking sequences from an E. coli plasmid with 
sequences from the yeast 2\i plasmid. 

Expression of h VLR1 
The nucleotide sequence coding for hVLRl, or antigenic fragment, 
derivative or analog thereof, or a functionally active derivative, including a chimeric 
20 protein, thereof, can be inserted into an appropriate expression vector, i.e., a vector 
which contains the necessary elements for the transcription and translation of the 
inserted protein-coding sequence. Thus, the nucleic acid encoding hVLRl of the 
invention is operationally associated with a promoter in an expression vector of the 
invention. Both cDNA and genomic sequences can be cloned and expressed under 
25 control of such regulatory sequences. An expression vector also preferably includes a 
replication origin. 

Alternatively, an hVLRl polypeptide of the invention can be prepared 
using well-known techniques in peptide synthesis, including solid phase synthesis 
(using, e.g., BOC of FMOC chemistry), or peptide condensation techniques. 
30 As used herein, the terms "polypeptide" and "protein" may be used 
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interchangably to refer to the gene product (or corresponding synthetic product) of an 
hVLRl gene. The term "protein" may also refer specifically to the polypeptide as 
expressed in cells. A peptide is generally a fragment of a polypeptide, e.g., of about 
six or more amino acid residues. 

5 The necessary transcriptional and translational signals can be provided 

on a recombinant expression vector, or they may be supplied by the native gene 
encoding hVLRl and/or its flanking regions. 

Potential host-vector systems include but are not limited to mammalian 
cell systems infected with virus {e.g., vaccinia virus, adenovirus, adeno-associated 

10 virus, herpes virus, etc.); insect cell systems infected with virus (e.g. , baculovirus); 
microorganisms such as yeast containing yeast vectors; or bacteria transformed with 
bacteriophage, DNA, plasmid DNA, or cosmid DNA. The expression elements of 
vectors vary in their strengths and specificities. Depending on the host-vector system 
utilized, any one of a number of suitable transcription and translation elements may be 

15 used. 

A preferred expression host is a eukaryotic cell (e.g., yeast, insect, or 
mammalian cell). More preferred is a mammalian cell, e.g., human, rat, monkey, dog, 
or hamster cell. In specific embodiments, infra, hVLRl is expressed in a human 
neuroblastoma cell line (e.g., SK-N-MC), or a Chinese hamster ovary cell line (e.g., 
20 CHO 293). 

A recombinant hVLRl protein of the invention, or functional fragment, 
derivative, chimeric construct, or analog thereof, may be expressed chromosomally, 
after integration of the coding sequence by recombination. In this regard, any of a 
number of amplification systems may be used to achieve high levels of stable gene 

25 expression (See Sambrook et al. , 1 989, supra). 

Any of the methods previously described for the insertion of DNA 
fragments into a cloning vector may be used to construct expression vectors 
containing a gene consisting of appropriate transcriptional/translational control signals 
and the protein coding sequences. These methods may include in vitro recombinant 

30 DNA and synthetic techniques and in vivo recombination (genetic recombination). 
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Expression of hVLRl protein may be controlled by any 
promoter/enhancer element known in the art, but these regulatory elements must be 
functional in the host selected for expression. Promoters which may be used to 
control hVLRl gene expression include, but are not limited to, cytomegalovirus 
5 (CMV) promoter (U.S. Patent Nos. 5,385,839 and 5,168,062), the SV40 early 

promoter region (Benoist and Chambon, 1981, Nature 290:304-310), the promoter 
contained in the 3' long terminal repeat of Rous sarcoma virus (Yamamoto, et al, Cell 
22:787-797, 1980), the herpes thymidine kinase promoter (Wagner et al, Proc. Natl. 
Acad. Sci. U.S.A. 78:1441-1445, 1981), the regulatory sequences of the 

10 metallothionein gene (Brinster et al , Nature 296:39-42, 1 982); prokaryotic expression 
vectors such as the P-lactamase promoter (Villa-Kamaroff, et al, Proc. Natl. Acad. 
Sci. U.S.A. 75:3727-3731, 1978), or the tac promoter (DeBoer, et al, Proc. Natl. 
Acad. Sci. U.S.A. 80:21-25, 1983); see also "Useful proteins from recombinant 
bacteria" in Scientific American, 242:74-94, 1 980; promoter elements from yeast or 

15 other fungi such as the Gal 4 promoter, the ADC (alcohol dehydrogenase) promoter, 
PGK (phosphoglycerol kinase) promoter, alkaline phosphatase promoter; and the 
animal transcriptional control regions, which exhibit tissue specificity and have been 
utilized in transgenic animals: elastase I gene control region which is active in 
pancreatic acinar cells (Swift et al, Cell 38:639-646, 1984; Ornitz et al, Cold Spring 

20 Harbor Symp. Quant. Biol. 50:399-409, 1986; MacDonald, Hepatology 7:425-515, 
1987); insulin gene control region which is active in pancreatic beta cells (Hanahan, 
Nature 315:1 15-122, 1985), immunoglobulin gene control region which is active in 
lymphoid cells (Grosschedl et al, Cell 38:647-658, 1984; Adames et al, Nature 
318:533-538, 1985; Alexander et al, Mol. Cell. Biol. 7:1436-1444, 1987), mouse 

25 mammary tumor virus control region which is active in testicular, breast, lymphoid 
and mast cells (Leder et al, Cell 45:485-495, 1986), albumin gene control region 
which is active in liver (Pinkert et al, Genes and Devel. 1 :268-276, 1987), alpha- 
fetoprotein gene control region which is active in liver (Krumlauf et al, Mol. Cell. 
Biol. 5:1639-1648, 1985; Hammer etal, Science 235:53-58, 1987), alpha 1- 

30 antitrypsin gene control region which is active in the liver (Kelsey et al , Genes and 
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Devel. 1:161-171, 1987), beta-globin gene control region which is active in myeloid 
cells (Mogram et aL, Nature 315:338-340, 1985; Kollias et aL, Cell 46:89-94, 1986), 
myelin basic protein gene control region which is active in oligodendrocyte cells in 
the brain (Readhead et al 9 Cell 48:703-712, 1987), myosin light chain-2 gene control 

5 region which is active in skeletal muscle (Sani, Nature 314:283-286, 1985), and 
gonadotropic releasing hormone gene control region which is active in the 
hypothalamus (Mason et aU Science 234:1372-1378, 1986). 

Expression Vectors 
A wide variety of host/expression vector combinations (i.e., expression 

10 systems) may be employed in expressing the DNA sequences of this invention. 
Useful expression vectors, for example, may consist of segments of chromosomal, 
non-chromosomal and synthetic DNA sequences. Suitable vectors include derivatives 
of SV40 and known bacterial plasmids, e.g., £ coli plasmids col El, pCRl, pBR322, 
pMal-C2, pET, pGEX (Smith et al, Gene 67:31-40, 1988), pMB9 and their 

15 derivatives, plasmids such as RP4; phage DNAS, e.g., the numerous derivatives of 
phage 1, e.g., NM989, and other phage DNA, e.g., Ml 3 and filamentous single 
stranded phage DNA; yeast plasmids such as the 2m plasmid or derivatives thereof; 
vectors useful in eukaryotic cells, such as vectors useful in insect or mammalian cells; 
vectors derived from combinations of plasmids and phage DNAs, such as plasmids 

20 that have been modified to employ phage DNA or other expression control sequences; 
and the like. 

Yeast expression systems can also be used according to the invention 
to express hVLRl . For example, the non-fusion pYES2 vector {Xbal, SphL Shol, 
Notl, GstXl, EcoRl, BstXl, BamW\,Sac\, Kpn\, and HindBl cloning sit; Invitrogen) 
25 or the fusion pYESHis A, B, C (Xbal, Sphl, Shol, Notl BstXl, £coRI, BamH 1 , Sad, 
KpnL and HindlH cloning site, N-terminal peptide purified with ProBond resin and 
cleaved with enterokinase; Invitrogen), to mention just two, can be employed 
according to the invention. 

Preferred vectors, particularly for cellular assays in vitro and in vivo, 
30 are viral vectors, such as lentiviruses, retroviruses, herpes viruses, adenoviruses, 
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adeno-associated viruses, vaccinia virus, baculovirus, and other recombinant viruses 
with desirable cellular tropism. Thus, a gene encoding a functional or mutant protein 
or polypeptide domain fragment thereof can be introduced in vivo, ex vivo, or in vitro 
using a viral vector or through direct introduction of DNA. Expression in targeted 

5 tissues can be effected by targeting the transgenic vector to specific cells, such as with 
a viral vector or a receptor ligand, or by using a tissue-specific promoter, or both. 
Targeted gene delivery is described in International Patent Publication WO 95/28494, 
published October 1995. 

Viral vectors commonly used for in vivo or ex vivo targeting and 

10 therapy procedures are DNA-based vectors and retroviral vectors. Methods for 
constructing and using viral vectors are known in the art {see, e.g., Miller and 
Rosman, BioTechniques, 7:980-990, 1992). Preferably, the viral vectors are 
replication defective, that is, they are unable to replicate autonomously in the target 
cell. Preferably, the replication defective virus is a minimal virus, i.e., it retains only 

15 the sequences of its genome which are necessary for encapsidating the genome to 
produce viral particles. 

DNA viral vectors include an attenuated or defective DNA virus, such 
as but not limited to herpes simplex virus (HSV), papillomavirus, Epstein Barr virus 
(EBV), adenovirus, adeno-associated virus (AAV), and the like. Defective viruses, 

20 which entirely or almost entirely lack viral genes, are preferred. Defective virus is not 
infective after introduction into a cell. Use of defective viral vectors allows for 
administration to cells in a specific, localized area, without concern that the vector can 
infect other cells. Thus, a specific tissue can be specifically targeted. Examples of 
particular vectors include, but are not limited to, a defective herpes virus 1 (HSV1) 

25 vector (Kaplitt et al, Molec. Cell. Neurosci. 2:320-330, 1991), defective herpes virus 
vector lacking a glyco-protein L gene (Patent Publication RD 371005 A), or other 
defective herpes virus vectors (International Patent Publication No. WO 94/21807, 
published September 29, 1994; International Patent Publication No. WO 92/05263, 
published April 2, 1994); an attenuated adenovirus vector, such as the vector 

30 described by Stratford-Perricaudet et al (J. Clin. Invest. 90:626-630, 1992: see also 
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La Salle et al, Science 259:988-990, 1993); and a defective adeno-associated virus 
vector (Samulski et al 9 J. Virol. 61:3096-3101, 1987; Samulski et ai, J. Virol. 
63:3822-3828, 1989; Lebkowski et a/., Mol. Cell. Biol. 8:3988-3996, 1988). 

Various companies produce viral vectors commercially, including but 
5 by no means limited to Avigen, Inc. (Alameda, CA; AAV vectors), Cell Genesys 
(Foster City, CA; retroviral, adenoviral, AAV vectors, and lentiviral vectors), 
Clontech (retroviral and baculoviral vectors), Genovo, Inc. (Sharon Hill, PA; 
adenoviral and AAV vectors), Genvec (adenoviral vectors), IntroGene (Leiden, 
Netherlands; adenoviral vectors), Molecular Medicine (retroviral, adenoviral, AAV, 

10 and herpes viral vectors), Norgen (adenoviral vectors), Oxford BioMedica (Oxford, 
United Kingdom; lentiviral vectors), and Transgene (Strasbourg, France; adenoviral, 
vaccinia, retroviral, and lentiviral vectors). 

Preferably, for in vivo administration, an appropriate 
immunosuppressive treatment is employed in conjunction with the viral vector, e.g., 

15 adenovirus vector, to avoid immuno-deactivation of the viral vector and transfected 
cells. For example, immunosuppressive cytokines, such as interleukin-12 (IL-12), 
interferon-g (IFN-g), or anti-CD4 antibody, can be administered to block humoral or 
cellular immune responses to the viral vectors (see, e.g., Wilson, Nature Medicine, 
1 995). In that regard, it is advantageous to employ a viral vector that is engineered to 

20 express a minimal number of antigens. 

Adenovirus vectors. Adenoviruses are eukaryotic DNA viruses that 
can be modified to efficiently deliver a nucleic acid of the invention to a variety of cell 
types. Various serotypes of adenovirus exist. Of these serotypes, preference is given, 
within the scope of the present invention, to using type 2 or type 5 human 

25 adenoviruses (Ad 2 or Ad 5) or adenoviruses of animal origin (see W094/26914). 
Those adenoviruses of animal origin which can be used within the scope of the 
present invention include adenoviruses of canine, bovine, murine (example: Mavl, 
Beard et aL, Virology 75 (1990) 81), ovine, porcine, avian, and simian (example: 
SAV) origin. Preferably, the adenovirus of animal origin is a canine adenovirus, more 

30 preferably a CAV2 adenovirus (e.g. Manhattan or A26/61 strain (ATCC VR-800), for 
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example). Various replication defective adenovirus and minimum adenovirus vectors 
have been described (W094/26914, WO95/02697, W094/28938, W094/28152, 
W094/12649, WO95/02697 W096/22378). The replication defective recombinant 
adenoviruses according to the invention can be prepared by any technique known to 
5 the person skilled in the art (Levrero et al , Gene 1 01 : 1 95 1 99 1 ; EP 1 85 573 ; Graham, 
EMBO J. 3:2917, 1984; Graham etal, J. Gen. Virol. 36:59 1977). Recombinant 
adenoviruses are recovered and purified using standard molecular biological 
techniques, which are well known to one of ordinary skill in the art. 

Adeno-associated viruses. The adeno-associated viruses (AAV) are 

10 DNA viruses of relatively small size which can integrate, in a stable and site-specific 
manner, into the genome of the cells which they infect. They are able to infect a wide 
spectrum of cells without inducing any effects on cellular growth, morphology or 
differentiation, and they do not appear to be involved in human pathologies. The 
AAV genome has been cloned, sequenced and characterized. The use of vectors 

15 derived from the AAVs for transferring genes in vitro and in vivo has been described 
(see WO 91/18088; WO 93/09239; US 4,797,368, US 5,139,941, EP 488 528). The 
replication defective recombinant AAVs according to the invention can be prepared 
by cotransfecting a plasmid containing the nucleic acid sequence of interest flanked by 
two AAV inverted terminal repeat (ITR) regions, and a plasmid carrying the AAV 

20 encapsidation genes (rep and cap genes), into a cell line which is infected with a 

human helper virus (for example an adenovirus). The AAV recombinants which are 
produced are then purified by standard techniques. 

Retrovirus vectors. In another embodiment the gene can be introduced 
in a retroviral vector, e.g., as described in Anderson et al, U.S. Patent No. 5,399,346; 

25 Mann et al, 1983, Cell 33:153; Temin et al, U.S. Patent No. 4,650,764; Temin et al, 
U.S. Patent No. 4,980,289; Markowitz et al., 1988, J. Virol. 62:1 120; Temin et al, 
U.S. Patent No. 5,124,263; EP 453242, EP178220; Bernstein et al Genet. Eng. 7 
(1985) 235; McCormick, BioTechnology 3 (1985) 689; International Patent 
Publication No. WO 95/07358, published March 16, 1995, by Dougherty et al; and 

30 Kuo et al, 1993, Blood 82:845. The retroviruses are integrating viruses which infect 
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dividing cells. The retrovirus genome includes two LTRs, an encapsidation sequence 
and three coding regions (gag, pol and env). In recombinant retroviral vectors, the 
gag y pol and env genes are generally deleted, in whole or in part, and replaced with a 
heterologous nucleic acid sequence of interest. These vectors can be constructed from 
5 different types of retrovirus, such as, HIV, MoMuLV ("murine Moloney leukaemia 
virus" MSV ("murine Moloney sarcoma virus"), HaSV ("Harvey sarcoma virus"); 
SNV ("spleen necrosis virus"); RSV ("Rous sarcoma virus") and Friend virus. 
Suitable packaging cell lines have been described in the prior art, in particular the cell 
line PA317 (US 4,861,719); the PsiCRIP cell line (WO 90/02806) and the 

10 GP+envAm-12 cell line (WO 89/07150). In addition, the recombinant retroviral 
vectors can contain modifications within the LTRs for suppressing transcriptional 
activity as well as extensive encapsidation sequences which may include a part of the 
gag gene (Bender et al , J. Virol. 61 : 1639, 1987). Recombinant retroviral vectors are 
purified by standard techniques known to those having ordinary skill in the art. 

1 5 Retroviral vectors can be constructed to function as infectious particles 

or to undergo a single round of transfection. In the former case, the virus is modified 
to retain all of its genes except for those responsible for oncogenic transformation 
properties, and to express the heterologous gene. Non-infectious viral vectors are 
manipulated to destroy the viral packaging signal, but retain the structural genes 

20 required to package the co-introduced virus engineered to contain the heterologous 
gene and the packaging signals. Thus, the viral particles that are produced are not 
capable of producing additional virus. 

Retrovirus vectors can also be introduced by DNA viruses, which 
permits one cycle of retroviral replication and amplifies tranfection efficiency {see 

25 WO 95/22617, WO 95/2641 1 , WO 96/39036, WO 97/19182). 

Lentivirus vectors. In another embodiment, lentiviral vectors are can 
be used as agents for the direct delivery and sustained expression of a transgene in 
several tissue types, including brain, retina, muscle, liver and blood. The vectors can 
efficiently transduce dividing and nondividing cells in these tissues, and maintain 

30 long-term expression of the gene of interest. For a review, see, Naldini, Curr. Opin. 
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BiotechnoL, 9:457-63, 1998; see also Zufferey, et aL, J. Virol., 72:9873-80, 1998). 
Lentiviral packaging cell lines are available and known generally in the art. They 
facilitate the production of high-titer lentivirus vectors for gene therapy. An example 
is a tetracycline-inducible VSV-G pseudotyped lentivirus packaging cell line which 

5 can generate virusparticles at titers greater than 106 IU/ml for at least 3 to 4 days 

(Kafri, et aL, J. Virol., 73: 576-584, 1999). The vector produced by the inducible cell 
line can be concentrated as needed for efficiently transducing nondividing cells in 
vitro and in vivo. 

Non-viral vectors. In another embodiment, the vector can be 

10 introduced in vivo by lipofection, as naked DNA, or with other transfection facilitating 
agents (peptides, polymers, etc.). Synthetic cationic lipids can be used to prepare 
liposomes for in vivo transfection of a gene encoding a marker (Feigner, et. aL, Proc. 
Natl. Acad. Sci. U.S.A. 84:7413-7417, 1987; Feigner and Ringold, Science 337:387- 
388, 1989; see Mackey, et aL, Proc. Natl. Acad. Sci. U.S.A. 85:8027-8031, 1988; 

15 Ulmer et aL, Science 259:1745-1748, 1993). Useful lipid compounds and 

compositions for transfer of nucleic acids are described in International Patent 
Publications W095/18863 and W096/17823, and in U.S. Patent No. 5,459,127. 
Lipids may be chemically coupled to other molecules for the purpose of targeting (see 
Mackey, et. aL, supra). Targeted peptides, e.g., hormones or neurotransmitters, and 

20 proteins such as antibodies, or non-peptide molecules could be coupled to liposomes 
chemically. 

Other molecules are also useful for facilitating transfection of a nucleic 
acid in vivo, such as a cationic oligopeptide (e.g., International Patent Publication 
W095/21931), peptides derived from DNA binding proteins (e.g., International Patent 
25 Publication WO96/25508), or a cationic polymer (e.g., International Patent 
Publication W095/21931). 

It is also possible to introduce the vector in vivo as a naked DNA 
plasmid. Naked DNA vectors for gene therapy can be introduced into the desired host 
cells by methods known in the art, e.g., electroporation, microinjection, cell fusion, 
30 DEAE dextran, calcium phosphate precipitation, use of a gene gun, or use of a DNA 
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vector transporter (see, e.g., Wu et al, J. Biol. Chem. 267:963-967, 1992; Wu and 
Wu, J. Biol. Chem. 263:14621-14624, 1988; Hartmut et al, Canadian Patent 
Application No. 2,012,31 1, filed March 15, 1990; Williams et al., Proc. Natl. Acad. 
Sci. USA 88:2726-2730, 1991). Receptor-mediated DNA delivery approaches can 
5 also be used (Curiel et al, Hum. Gene Ther. 3:147-154, 1992; Wu and Wu, J. Biol. 
Chem. 262:4429-4432, 1987). US Patent Nos. 5,580,859 and 5,589,466 disclose 
delivery of exogenous DNA sequences, free of transfection facilitating agents, in a 
mammal. Recently, a relatively low voltage, high efficiency in vivo DNA transfer 
echnique, termed electrotransfer, has been described (Mir et al, CP. Acad. Sci., 

10 321:893, 1998; WO 99/01157; WO 99/01158; WO 99/01175). 

Antibodies to hVLRl 
According to the invention, hVLRl polypeptides produced 
recombinantly or by chemical synthesis, and fragments or other derivatives or analogs 
thereof, including fusion proteins, may be used as an immunogen to generate 

15 antibodies that recognize the hVLRl polypeptide. Such antibodies include but are not 
limited to polyclonal, monoclonal, chimeric, single chain, Fab fragments, and an Fab 
expression library. Such an antibody is specific for human hVLRl . 

Various procedures known in the art may be used for the production of 
polyclonal antibodies to hVLRl polypeptide or derivative or analog thereof. For the 

20 production of antibody, various host animals can be immunized by injection with the 
hVLRl polypeptide, or a derivative (e.g., fragment or fusion protein) thereof, 
including but not limited to rabbits, mice, rats, sheep, goats, etc. In one embodiment, 
the hVLRl polypeptide or fragment thereof can be conjugated to an immunogenic 
carrier, e.g., bovine serum albumin (BSA) or keyhole limpet hemocyanin (KLH). 

25 Various adjuvants may be used to increase the immunological response, depending on 
the host species, including but not limited to Freund's (complete and incomplete), 
mineral gels such as aluminum hydroxide, surface active substances such as 
lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanins, dinitrophenol, and potentially useful human adjuvants such as BCG 

30 (bacille Calmette-Guerin) and Corynebacterium parvum. 
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For preparation of monoclonal antibodies directed toward the hVLRl 
polypeptide, or fragment, analog, or derivative thereof, any technique that provides for 
the production of antibody molecules by continuous cell lines in culture may be used. 
These include but are not limited to the hybridoma technique originally developed by 
5 Kohler and Milstein (Nature 256:495-497, 1975), as well as the trioma technique, the 
human B-cell hybridoma technique (Kozbor et al, Immunology Today 4:72, 1983; 
Cote et al, Proc. Natl. Acad. Sci. U.S.A. 80:2026-2030, 1983), and the EBV- 
hybridoma technique to produce human monoclonal antibodies (Cole et al, in 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96, 1985). 

10 Production of human antibodies by CDR grafting is described in U.S. Patent Nos. 
5,585,089, 5,693,761, and 5,693,762 to Queen et al, and also in U.S. Patent No. 
5,225,539 to Winter and International Patent Application PCT/W09 1/09967 by Adau 
et al In an additional embodiment of the invention, monoclonal antibodies can be 
produced in germ-free animals (International Patent Publication No. WO 89/12690, 

15 published 28 December 1989). In fact, according to the invention, techniques 

developed for the production of "chimeric antibodies" (Morrison et al , J. Bacteriol. 
1 59:870, 1 984); Neuberger et al , Nature 3 1 2:604-608, 1 984; Takeda et al , Nature 
3 14:452-454, 1985) by splicing the genes from a mouse antibody molecule specific 
for an hVLRl polypeptide together with genes from a human antibody molecule of 

20 appropriate biological activity can be used; such antibodies are within the scope of 
this invention. Such human or humanized chimeric antibodies are preferred for use in 
therapy of human diseases or disorders (described infra), since the human or 
humanized antibodies are much less likely than xenogenic antibodies to induce an 
immune response, in particular an allergic response, themselves. 

25 According to the invention, techniques described for the production of 

single chain antibodies (U.S. Patent Nos. 5,476,786 and 5,132,405 to Huston; U.S. 
Patent 4,946,778) can be adapted to produce hVLRl polypeptide-specific single chain 
antibodies. An additional embodiment of the invention utilizes the techniques 
described for the construction of Fab expression libraries (Huse et al, Science 

30 246:1275-128h 1989) to allow rapid and easy identification of monoclonal Fab 
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fragments with the desired specificity for an hVLRl polypeptide, or its derivatives, or 
analogs. 

Antibody fragments which contain the idiotype of the antibody 
molecule can be generated by known techniques. For example, such fragments include 
5 but are not limited to: the F(ab') 2 fragment which can be produced by pepsin digestion 
of the antibody molecule; the Fab' fragments which can be generated by reducing the 
disulfide bridges of the F(ab') 2 fragment, and the Fab fragments which can be 
generated by treating the antibody molecule with papain and a reducing agent. 

In the production of antibodies, screening for the desired antibody can 

10 be accomplished by techniques known in the art, e.g., radioimmunoassay, ELISA 
(enzyme-linked immunosorbant assay), "sandwich" immunoassays, 
immunoradiometric assays, gel diffusion precipitin reactions, immunodiffusion 
assays, in situ immunoassays (using colloidal gold, enzyme or radioisotope labels, for 
example), western blots, precipitation reactions, agglutination assays (e.g., gel 

15 agglutination assays, hemagglutination assays), complement fixation assays, 

immunofluorescence assays, protein A assays, and immunoelectrophoresis assays, etc. 
In one embodiment, antibody binding is detected by detecting a label on the primary 
antibody. In another embodiment, the primary antibody is detected by detecting 
binding of a secondary antibody or reagent to the primary antibody. In a further 

20 embodiment, the secondary antibody is labeled. Many means are known in the art for 
detecting binding in an immunoassay and are within the scope of the present 
invention. For example, to select antibodies which recognize a specific epitope of an 
hVLRl polypeptide, one may assay generated hybridomas for a product which binds 
to an hVLRl polypeptide fragment containing such epitope. For selection of an 

25 antibody specific to an hVLRl polypeptide from a particular species of animal, one 
can select on the basis of positive binding with hVLRl polypeptide expressed by or 
isolated from cells of that species of animal. 

The foregoing antibodies can be used in methods known in the art 
relating to the localization and activity of the hVLRl polypeptide, e.g., for Western 

30 blotting, imaging hVLRl polypeptide in situ, measuring levels thereof in appropriate 
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physiological samples, etc. using any of the detection techniques mentioned above or 
known in the art. Such antibodies can also be used in assays for ligand binding, e.g. , 
as described in U.S. Patent No. 5,679,582. 

In a specific embodiment, antibodies that agonize or antagonize the 
5 activity of hVLRl polypeptide can be generated. Such antibodies can be tested using 
the assays described infra for identifying ligands. 

Detection of hVLRl Expression 
One of ordinary skill in the art can use hVLRl -specific 
oligonucleotides (PCR primers and probes) to detect expression of hVLRl mRNA. 
10 Expression can be detected by Northern analysis or by reverse transcriptase-PCR (RT- 
PCR). Alternatively, mRNA can be detected by expression of the encoded protein, 
e.g., in a reticulocyte assay, or by making cDNA and expressing it in aXenopous 
oocyte assay. However, these latter techniques are more cumbersome and difficult; 
the Northern or RT-PCR analysis is preferred. 
15 Similarly, one can use antibodies to hVLRl to detect expression by 

immunoassay. For example, immunohistology techniques permit detection of 
expression of hVLRl receptor. By manipulating the assay conditions, one can 
distinguish extracellular and intracellular expression. Antibodies for immunodetection 
of hVLRl may be specific for hVLRl or for a tag fused to the hVLRl protein. In 
20 addition, it is possible to use biochemical techniques, such as ligand binding affinity, 
to establish that the functional hVLRl protein is present on cells. 

For either technique, cells for testing can be obtained from nasal 
tissues, particularly vomeronasal tissues, from human biopsy or surgical procedures 
(e.g., rhinoplasties). Alternatively, the cells can be obtained from transgenic animals, 
25 and these techniques permit evaluation of hVLRl expression in the animals. These 
techniques may also be used to detect hVLRl expression in tissue culture. 

Screening and Chemistry 
Identification and isolation of hVLRl provides for development of 
screening assays, particularly for high throughput screening of molecules that up- or 
30 down-regulate the activity of hVLRl, e.g., by permitting expression of hVLRl in 
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quantities greater than can be isolated from natural sources, or in indicator cells that 
are specially engineered to indicate the activity of hVLRl expressed after transfection 
or transformation of the cells. Accordingly, the present invention contemplates 
methods for identifying specific ligands of hVLRl using various screening assays 
5 known in the art. Furthermore, the invention permits identification of ligands that 
selectively bind hVLRl to a greater degree than to other pheromone-like receptors. 

Any screening technique known in the art can be used to screen for 
hVLRl agonists or antagonists. The present invention contemplates screens for small 
molecule ligands or ligand analogs and mimics, as well as screens for natural ligands 

10 that bind to and agonize or antagonize the activity of hVLRl in vivo. For example, 
natural products libraries can be screened using assays of the invention for molecules 
that agonize or antagonize hVLRl activity. Generally, compounds are tested for the 
ability to compete with labeled pheromone-like substrate or a pheromone-like analog, 
for binding to the hVLRl . Screens can either be "cell-free", /. e. , binding assays of 

15 receptor protein with candidate compounds, where the protein is on a solid support in 
a liposome or micelle, or cell-based, in which the protein is found in a cell membrane. 
It is also possible to directly label the test compound to evaluate 

binding. 

Knowledge of the primary sequence of the protein, and the similarity of 
20 that sequence with proteins of known function, can provide an initial clue as to the 
inhibitors or antagonists of the protein. Identification and screening of antagonists is 
further facilitated by determining structural features of the protein, e.g., using X-ray 
crystallography, neutron diffraction, nuclear magnetic resonance spectrometry, and 
other techniques for structure determination. These techniques provide for the 
25 rational design or identification of agonists and antagonists. 

Another approach uses recombinant bacteriophage to produce large 
libraries. Using the "phage method" (Scott and Smith, Science 249:386-390, 1990; 
Cwirla, et al, Proc. Natl. Acad. Sci., 87:6378-6382, 1990; Devlin et a/., Science, 
49:404-406, 1990), very large libraries can be constructed (10 6 -10 8 chemical entities). 
30 A second approach uses primarily chemical methods, of which the Geysen method 
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(Geysen et al, Molecular Immunology 23:709-71 5, 1986; Geysen et al J. 
Immunologic Method 102:259-274, 1987; and the method of Fodor et al (Science 
251:767-773, 1991) are examples. Furka et al (14th International Congress of 
Biochemistry, Volume #5, Abstract FR:013, 1988; Furka, Int. J. Peptide Protein Res. 

5 37:487-493, 1991), Houghton (U.S. Patent No. 4,631,21 1, issued December 1986) 
and Rutter et al (U.S. Patent No. 5,010,175, issued April 23, 1991) describe methods 
to produce a mixture of peptides that can be tested as agonists or antagonists. 

In another aspect, synthetic libraries (Needels et al, Proc. Natl. Acad. 
Sci. USA 90:10700-4, 1993; Ohlmeyer et al, Proc. Natl. Acad. Sci. USA 90:10922- 

10 10926, 1993; Lam et al, International Patent Publication No. WO 92/00252; Kocis et 
al, International Patent Publication No. WO 9428028) and the like can be used to 
screen for hVLRl ligands according to the present invention. 

In another embodiment, a yeast screening assay, useful for testing 
agonists and antagonists of mammalian G-protein coupled receptors, e.g., as disclosed 

15 in U.S. Patent No. 5,482,832, can be used. 

Test compounds are screened from large libraries of synthetic or 
natural compounds. Numerous means are currently used for random and directed 
synthesis of saccharide, peptide, and nucleic acid based compounds. Synthetic 
compound libraries are commercially available from Maybridge Chemical Co. 

20 (Trevillet, Cornwall, UK), Comgenex (Princeton, NJ), Brandon Associates 

(Merrimack, NH), and Microsource (New Milford, CT). A rare chemical library is 
available from Aldrich (Milwaukee, WI). Alternatively, libraries of natural 
compounds in the form of bacterial, fungal, plant and animal extracts are available 
from e.g. Pan Laboratories (Bothell, WA) or MycoSearch (NC), or are readily 

25 producible. Additionally, natural and synthetically produced libraries and compounds 
are readily modified through conventional chemical, physical, and biochemical means 
(Blondelle et al, Tib Tech, 14:60, 1996). 
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In vivo and ex vivo screening methods 
Intact cells or whole animals expressing a gene encoding hVLRl can 
be used in screening methods to identify candidate drugs. 

In one series of embodiments, a permanent cell line is established. 
5 Alternatively, cells (including without limitation mammalian, insect, yeast, or 

bacterial cells) are transiently programmed to express an hVLRl gene by introduction 
of appropriate DNA or mRNA. Identification of candidate compounds can be 
achieved using any suitable assay, including without limitation (i) assays that measure 
selective binding of test compounds to hVLRl (ii) assays that measure the ability of a 

10 test compound to modify (i.e., inhibit or enhance) a measurable activity or function of 
hVLRl and (iii) assays that measure the ability of a compound to modify (i.e., inhibit 
or enhance) the transcriptional activity of sequences derived from the promoter (i.e., 
regulatory) regions the hVLRl gene. 

Identification of ligands for olfactory receptors can be achieved by 

15 screening for functional expression of receptors, e.g. see Krautwurst et al, Cell 
95:917-926, 1998. 

Gene targeting technology to introduce mutations in putative 
pheromone receptor genes. By generating alleles differentially tagged with the 
histological markers, the putative pheromone receptor gene expression pattern can be 

20 detected topographically. These histological markers combines the intrinsic 
fluorescent properties of green fluorescent protein (GFP) with the microtubule- 
binding properties of the tau protein, assuring that the fluorescent label will be 
exported down the axons to the axon terminals of the vomeronasal sensory neurons 
that express the marker-tagged receptor genes. A deletion allele of the putative 

25 receptor gene determines whether expression of the putative receptor gene is a 
determinant of the pattern of axonal projections. The intronless coding region is 
excised from start to stop codon and replaced with a GFP-IRES-taulacZ cassette. To 
determine whether other seven-transmembrane proteins can substitute for the putative 
receptor gene, the coding region of the putative receptor gene can be replaced with 

30 that of a known odorant receptor gene, thus generating a swap. The consequence of 
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this swap on the topography of the projections can be examined by visualizing the 
taulacZ expression (Rodriquez et al, Cell 97:199-208, 1999). 

Transgenic mammals can be prepared for evaluating the molecular 
mechanisms of hVLRl . Such mammals provide excellent models for screening or 

5 testing drug candidates. Thus, hVLRl "knock-in" mammals can be prepared for 
evaluating the molecular biology of this system in greater detail than is possible with 
human subjects. It is also possible to evaluate compounds or diseases on "knockout" 
animals, e.g. , to identify a compound that can compensate for a defect in hVLRl . 
Both technologies permit manipulation of single units of genetic information in their 

10 natural position in a cell genome and to examine the results of that manipulation in 
the background of a terminally differentiated organism. 

A "knockin" mammal is a mammal in which an endogenous gene is 
substituted with a heterologous gene (Roemer et al, New Biol. 3:331, 1991). 
Preferably, the heterologous gene is "knocked-in" to a locus of interest, either the 

15 subject of evaluation(in which case the gene may be a reporter gene; see Elefanty et 
al, Proc Natl Acad Sci USA 95:1 1897,1998) of expression or function of a 
homologous gene, thereby linking the heterologous gene expression to transcription 
from the appropriate promoter. This can be achieved by homologous recombination, 
transposon (Westphal and Leder, Curr Biol 7:530, 1997), using mutant recombination 

20 sites (Araki et al , Nucleic Acids Res 25:868, 1 997) or PCR (Zhang and Henderson, 
Biotechniques 25:784, 1998). 

A "knockout mammaT is an mammal (e.g., mouse) that contains 
within its genome a specific gene that has been inactivated by the method of gene 
targeting (see, e.g., US Patents No. 5,777,195 and No. 5,616,491). A knockout 

25 mammal includes both a heterozygote knockout (i.e., one defective allele and one 

wild-type allele) and a homozygous mutant (i.e., two defective alleles). Preparation of 
a knockout mammal requires first introducing a nucleic acid construct that will be 
used to suppress expression of a particular gene into an undifferentiated cell type 
termed an embryonic stem cell. This cell is then injected into a mammalian embryo. 

30 A mammalian embryo with an integrated cell is then implanted into a foster mother 
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for the duration of gestation. Zhou, et al (Genes and Development, 9:2623-34, 1995) 
describes PPCA knock-out mice. 

The term "knockout" refers to partial or complete suppression of the 
expression of at least a portion of a protein encoded by an endogenous DNA sequence 
5 in a cell. The term "knockout construct" refers to a nucleic acid sequence that is 

designed to decrease or suppress expression of a protein encoded by endogenous DNA 
sequences in a cell. The nucleic acid sequence used as the knockout construct is 
typically comprised of (1) DNA from some portion of the gene (exon sequence, intron 
sequence, and/or promoter sequence) to be suppressed and (2) a marker sequence used 

10 to detect the presence of the knockout construct in the cell. The knockout construct is 
inserted into a cell, and integrates with the genomic DNA of the cell in such a position 
so as to prevent or interrupt transcription of the native DNA sequence. Such insertion 
usually occurs by homologous recombination (i.e., regions of the knockout construct 
that are homologous to endogenous DNA sequences hybridize to each other when the 

15 knockout construct is inserted into the cell and recombine so that the knockout 

construct is incorporated into the corresponding position of the endogenous DNA). 
The knockout construct nucleic acid sequence may comprise 1) a full or partial 
sequence of one or more exons and/or introns of the gene to be suppressed, 2) a full or 
partial promoter sequence of the gene to be suppressed, or 3) combinations thereof. 

20 Typically, the knockout construct is inserted into an embryonic stem cell (ES cell) and 
is integrated into the ES cell genomic DNA, usually by the process of homologous 
recombination. This ES cell is then injected into, and integrates with, the developing 
embryo. 

The phrases "disruption of the gene" and "gene disruption" refer to 
25 insertion of a nucleic acid sequence into one region of the native DNA sequence 

(usually one or more exons) and/or the promoter region of a gene so as to decrease or 
prevent expression of that gene in the cell as compared to the wild-type or naturally 
occurring sequence of the gene. By way of example, a nucleic acid construct can be 
prepared containing a DNA sequence encoding an antibiotic resistance gene which is 
30 inserted into the DNA sequence that is complementary to the DNA sequence 



WO 01/25431 



PCT/US00/27211 



-51- 

(promoter and/or coding region) to be disrupted. When this nucleic acid construct is 
then transfected into a cell, the construct will integrate into the genomic DNA. Thus, 
many progeny of the cell will no longer express the gene at least in some cells, or will 
express it at a decreased level, as the DNA is now disrupted by the antibiotic 
5 resistance gene. 

Generally, the DNA will be at least about 1 kilobase (kb) in length and 
preferably 3-4 kb in length, thereby providing sufficient complementary sequence for 
recombination when the knockout construct is introduced into the genomic DNA of 
the ES cell (discussed below). 

10 Included within the scope of this invention is a mammal in which two 

or more genes have been knocked out. Such mammals can be generated by repeating 
the procedures set forth herein for generating each knockout construct, or by breeding 
to mammals, each with a single gene knocked out, to each other, and screening for 
those with the double knockout genotype. 

1 5 Regulated knockout animals can be prepared using various systems, 

such as the tet-repressor system {see US Patent No. 5,654,168) or the Cre-Lox system 
{see US Patents No. 4,959,317 and No. 5,801,030). 

In another series of embodiments, transgenic animals are created in 
which (i) a human hVLRl is stably inserted into the genome of the transgenic animal; 

20 and/or (ii) the endogenous vomeronasal receptor genes are inactivated and replaced 
with human hVLRl genes. See, e.g., Coffman, Semin. Nephrol. 17:404, 1997; Esther 
et ai, Lab. Invest. 74:953, 1996; Murakami et a!. 9 Blood Press. Suppl. 2:36, 1996. 
Such animals can be treated with candidate compounds and monitored for responses 
associated with pheromone stimulation. 

25 Hi gh-Throughput Screen 

Agents according to the invention may be identified by screening in 
high-throughput assays, including without limitation cell-based or cell-free assays. It 
will be appreciated by those skilled in the art that different types of assays can be used 
to detect different types of agents. Several methods of automated assays have been 

30 developed in recent years so as to permit screening of tens of thousands of compounds 
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in a short period of time, such as assays based on protein stability when contacted 
with a candidate ligand (see U.S. Patent Nos. 5,679,582 and 5,585,277). Such high- 
throughput screening methods are particularly preferred. The use of high-throughput 
screening assays to test for agents is greatly facilitated by the availability of large 
5 amounts of purified polypeptides, as provided by the invention. 

Preferred Screening Methods 
There are several screening methods available for the discovery of non- 
peptide antagonists. These screens include radio ligand binding, signal transduction, 
yeast expression, reporter assays, and structure function of existing peptide agonists 

10 and non-peptide antagonists. The hVLRl expressed in yeast displays 

pharmacological properties similar to that observed for this receptor when expressed 
in mammalian cells. The utilization of yeast as a screening tool can accelerate the 
search for novel pheromone analogs. This technology can be utilized for screening 
of novel compounds that are identified in high throughput screens. 

15 Signal Transduction Assays 

G protein coupled receptors (GPCR) are coupled to a variety of 
heterotrimeric G proteins, which are comprised of a, P, and y subunits. Upon agonist 
binding to a GPCR at the cell surface, conformational changes occur within the 
agonist:GPCR complex, which lead to the dissociation of the G protein a subunit 

20 from the Py subunits. The G a and G^ subunits then stimulate a variety of intracellular 
effectors, which transduce the extracellular signal to the inside of the cell. Various 
signal transduction systems known to be coupled to GPCRs include adenylate cyclase, 
phospholipase C, phospholipase A 2 , sodium/hydrogen exchange, etc. Thus, 
measurements of intracellular calcium concentrations and adenylate cyclase activity 

25 indicate whether a hit or test compound is functionally behaving as an agonist or 
antagonist at the vomeronasal-iike receptor. 

In a specific embodiment, G-protein signal transduction is coupled to 
expression of a reporter gene, thus permitting a reporter gene screening assay. 
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Calcium Mobilization Assay 
Whole cells expressing the vomeronasal-like receptor are loaded with a 
fluorescent dye that chelates calcium ions, such as FURA-2. Upon addition of 
pheromone-like substrate to these cells, pheromone-like substrate binds to the 
5 vomeronasal-like receptors and calcium is released from the intracellular stores. The 
dye chelates these calcium ions. Spectrophotometric determination of the ratio for 
dyeicalcium complexes to free dye determine the changes in intracellular calcium 
concentrations upon addition of pheromone-like substrate. Hits from screens and 
other test compounds can be similarly tested in this assay to functionally characterize 

10 them as agonists or antagonists. Increases in intracellular calcium concentrations are 
expected for compounds with agonist activity while compounds with antagonist 
activity are expected to block pheromone-like substrate stimulated increases in 
intracellular calcium concentrations. 

Cyclic AMP Accumulation Assay 

15 Upon agonist binding, G s coupled GPCRs stimulate adenylate cyclase. 

Adenylate cyclase catalyzes the production of cyclic AMP from 
adenosine-5-triphosphate which, in turn, activates protein kinases. G ( coupled 
GPCRs are also coupled to adenylate cyclase, however, agonist binding to these 
receptors results in the inhibition of adenylate cyclase and the subsequent inhibition of 

20 cAMP. To measure the inhibition of cAMP accumulation, cells expressing G t 

coupled receptors must first be stimulated to elevate cAMP levels. This is achieved 
by treating the cells with forskolin, a diterpene that directly stimulates cAMP 
production. Co-incubation of cells expressing Gj coupled receptors with forskolin and 
a functional agonist will result in the inhibition of forskolin-stimulated cAMP 

25 accumulation. For a cAMP assay, whole cells stably expressing hVLRl can be 

incubated with a test compound, and with forskolin plus a test compound. The cells 
are then lysed and cAMP levels are measured using the [ 125 I]cAMP scintillation 
proximity assay (SPA). Functional agonists of G s coupled receptors are expected to 
increase cAMP levels above basal levels whereas functional agonists of G ( coupled 

30 receptors are expected to inhibit the forskolin-stimulated cAMP accumulation. 
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EXAMPLES 

The present invention will be better understood by reference to the 
following Examples, which are provided as exemplary of the invention, and not by 
way of limitation. 

5 

EXAMPLE 1: Isolation of Human VomeronasaMike Receptor Sequences 

In an attempt to clone vomeronasal-like receptors from human, we 
designed degenerate primers specific for selected conserved regions in mouse and rat 
vomeronasal receptor (VR) genes. This approach was chosen to favor the 
10 amplification of VR-like sequences with intact open reading frames (ORFs), 
reasoning that pseudogenes would differ at these conserved motifs more than 
functional genes. Five conserved regions were chosen and 8 primers were synthesized, 
allowing for 12 different primer pairs to be tested. 
The sense primers are: 

1 5 TAC32: CTI AGY CCI AGR AGY TCI TG (SEQ ID NO:5) 

TAC33 : ATM GCI ACI CCI AAY TR AC (SEQ ID NO:6) 

TAC145: AAR GCI TCI CCI GAR CAR AGR GCI AC (SEQ ID NO:7) 

The reverse primers are: 
TAC35: ARI ARI GCI ACC ATR TAI C (SEQ ID NO:8) 

20 TAC36: CKI GTI GCY CTY TGY TCI GG (SEQ ID NO:9) 

TAC143: ACR AAI GGR CTI ACI GTI GCR TA (SEQ ID NO:10) 

TAC143': TCR GGI AAR CAI TAD WSI TG (SEQ ID NO: 1 1) 

TAC144: ARI ATI GTY CTI GTI GCY CTY TG. (SEQ ID NO:12) 

Although these primers amplify multiple mouse VR genes, a single 
25 band is observed when the PCR product is separated on an agarose gel, because all 
VR coding sequences are contained in a single exon and have a similar size. 

Human genomic DNA from a Caucasian male was then used as 
template. PCR conditions were: 94°C for 1 min, 48*C for 3 min, 72*C for 3 min, 39 
cycles with 6 sec extension per cycle. Amplified PCR products, which were similar in 
30 size to those obtained from mouse genomic DNA, were isolated and cloned into 
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pGEMT. Three pairs of primers allowed the amplification of potential human VR 
receptor genes, and 5 clones for each of the amplified products were subcloned and 
sequenced. The sequenced products corresponding to potential human VRs were then 
used as templates to synthesize radioactive probes for screening a human genomic 
5 BAC library. 

In a parallel approach, a pair of degenerate primers was used to amplify 
mouse genomic DNA. 23 different mouse VR receptor partial sequences were cloned 
and sequenced from this amplified product. This heterogenous product was then used 
as a template for a probe to screen the human BAC library, at medium stringency 
10 (59'C). 

From both approaches, 27 human BAC clones were selected, from 
which 8 distinct subclones containing sequences homologous to mouse or rat VR 
genes were isolated. One human sequence, termed Bh33, has a complete ORF 
potentially encoding for a protein similar to the mouse or rat VRs from the VR1 

15 family (Figure 1). Subsequently, it was determined that hVLRl might have an 

alternative expression form using an upstream ATG start site in reading frame with 
the putative start site shown in Figure 1. This "long" form is shown in Figure 2. The 
Bh33 deduced amino acid sequence is 28% identical and 47% similar to the mouse 
VR^2 sequence, and many of the conserved amino acid residues in rat and mouse VRs 

20 are also conserved in Bh33 (see Figure 3). 

The seven other BAC subclones were human pseudogenes, as they 
contained multiple frameshifts and stop codons in the coding sequence (Figure 4). 

To find potential polymorphisms in humans, the Bh33 coding sequence 
was cloned and sequenced from four Caucasian males and individuals of Indonesian, 

25 Pygmy, Amerindian, Cambodian, Japanese, Ami, and Adygei extractions. The 
sequences from the four Caucasians were identical, and two single nucleotide 
substitutions were found in the Indonesian, Pygmy, Japanese, Cambodian, and Ami 
subjects, resulting in two amino acid differences (S201 F, A229 -* D). The 
Amerindian and Adygei subjects each had the S201 F substitution. None of the 

30 allelic variants were found to be interrupted by a stop codon or to contain a frameshift. 
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EXAMPLE 2: Detection of hVLRl In Human Vomeronasal Tissue 

In order to explore the expression of the BH33 gene in the human 
olfactory system, sensory olfactory epithelium was extracted from patients, RNA 
isolated from the tissues, and cDNA generated from this RNA. As expected, human 
5 olfactory epithelium expressed hVLRl mRNAs. 

Using specific primers on the BH33 coding sequence and an "adapter 
oligonucleotide", the 5' untranslated sequence of BH33 was cloned by 5'RACE. The 
sequence of the 5 f untranslated region showed, when compared to the genomic 
sequence of the BH33 locus, that the BH33 gene consists of three exons (see Figure 
10 5B) and that the coding sequence is entirely included in the last exon. The predicted 
open reading frame would encode for a 313 amino acids seven-transmembrane protein 
(see Figure 5B). 

These data indicated that, in addition of having a full length 

vomeronasal 

15 receptor-like open reading frame, BH33 has functional splice donor and acceptor sites 
and that the promoter is also intact. It also demonstrates that the BH33 gene is 
expressed in human olfactory epithelium. 

Another BH33 splice variant was also found to be expressed in the 
olfactory epithelium, which seemed to use a cryptic splice site inside the BH33 coding 

20 sequence, and which used a fourth exon (see Figure 5B). The potential protein 

product of this splice variant would have 243 amino acids and would share homology 
to vomeronasal receptors only on the two last thirds of its sequence. 

EXAMPLE 3: Knock-In ofhVLRl 

25 In order to explore the potential of the hVLRl to substitute for a mouse 

vomeronasal receptor, the coding sequence of the mouse VR2 receptor is substituted 
by the coding sequence of hVLRl and green fluorescent protein under control of an 
internal ribosome expression site (IRES), as was done previously with another 
receptor system (Rodriguez el al, Cell, 1999, 97:199-208). This procedure is shown 

30 schematically in Figure 6. The projection pattern of the neurons expressing the 
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swapped receptor are analyzed. The neurons expressing the human receptor are also 
easily detected as they will express green flourescent protein (GFP). This will allow 
the study of hVLRl ligands by following single cell events, e.g., calcium fluxes. 

5 EXAMPLE 4: hVLRl Polymorphic Variants 

In order to investigate the presence of hVLRl orthologs in non-human 
primates, a V1LR1 ortholog was cloned from chimpanzee genomic DNA. 

Human genomic DNAs from different ethnic backgrounds were 
obtained from the Coriell Cell Repositories (New Jersey). Genomic DNA from 
10 chimpanzee was extracted from blood obtained from the Yerkes Regional Primate 
Center (Atlanta, Georgia). Primers used for the amplification of the human and the 
chimpanzee sequences flanked the VI LR1 ORF, and are as follows: TAC558: TTC 
TCT GCA GTT GGA CAC ACA AGC (SEQ ID NO: 1 3), and TAC559: GCA AGA 
GTT ATG ATA AAT AGC TG (SEQ ID NO: 14). 
15 The deduced amino acid sequences from two human V1RL1 splice 

variants (a and b) were aligned with the putative chimpanzee ortholog (cVlRLl or 
cVRLl) (SEQ ID NO: 17) and the two mouse VI R sequences, mVR23 (SEQ ID 
NO:18) and mpr2 (SEQ ID NO:19) (Genbank accession number Y12724). The cloned 
V1LR1 ortholog coding sequence (CDS) codes for a protein sharing 93% identity with 
20 VLR1 , diverging mainly at the C-terminus (Fig. 7). Again, the 1 1 VlR-conserved 
residues and potential N-linked glycosylation site are present within cVLRl . 

EXAMPLE 5: Tissue Specificity Of hVLRl Expression 

In order to determine the tissue-specificity of hVLRl expression, a 
25 wide range of tissues and organs were screened by RT-PCR analysis by hybridization 
with a V1LR1 probe (Fig. 8). 

Primers used for the amplification of hVLRl cDNA in different human 
tissue samples (Human Multiple Tissue cDNA Panels, Clontach) were located in 
hVLRl exons 2 and 4: TAC588: GTT CCC ATG AAC TCA GAA G (SEQ ID 
30 NO:13) and TAC597: TGG CTG AGA ATC AAG TCC GT (SEQ ID NO:14). 
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Subsequent hybridizations were performed with a probe corresponding to the hVLRl 
mRNA sequence flanked by TAC588 and TAC597, which includes part of the CDS. 
For the evaluation of the expression level of hVLRl mRNA, a human multiple tissue 
expression array (MTe, Clonetech) was employed. 
5 hVLRl mRNA expression was found consistently found in the 

olfactory mucosa. In contrast, mRNA expression was detected with poor 
reproducibility in the brain, lung and kidney, probably reflecting very low expression 
levels in these tissues. (Due to its very high sensitivity, this method does not provide 
more than qualitative information regarding the levels of hVLRl mRNA in a given 
10 tissue.) 

Potential expression of hVLRl mRNA in non-olfactory tissues was 
further investigated by dot blot analysis, with a commercially available panel of 76 
human polyA* mRNAs pools, each corresponding to a different tissue. The hVLRl 
probe did not detectably hybridize to any sample tested except for a very weak signal 

1 5 with adult cerebellum mRNA (data not shown). The variable presence of hVLRl 

transcripts outside the olfactory system is not surprising as in vertebrates, expression 
of odorant receptor mRNAs has also been reported in non-olfactory tissues such as 
testis (Parmentier, M. et al. Nature 355, 453-455,1992), heart Drutel, G. et al. 
Receptors Channels 3, 33-40, 1995), spleen (Blache, P., et al., D. Biochem. Biophys. 

20 Res. Commun. 242, 669-672, 1998) and notochord (Nef, S.,et al., Mech. Dev. 55, 
65-77,1996); the significance of this ectopic expression is unclear. 

Redundant samples of olfactory mucosa from three adult patients were 
removed during elective surgery for septodermoplasty, following approval of the 
Institutional Review Board at The Rockefeller University and of the Yale University 

25 School of Medicine. The indication for surgery was recurrent nasal bleeds. Tissue 
was frozen at -70 °C within two minutes of having been surgically resected. Human 
olfactory epithelium is not a uniform sensory sheet but exhibits an irregular and 
patchy distribution of olfactory sensory neurons within the nasal cavity (Morrison, 
E.E. & Costanzo, R.M. J. Comp. Neurol. 297, 1-13, 1990; Morrison, E.E. & 

30 Costanzo, R.M. Microsc. Res. Tech. 23, 49-61 , 1992). The presence of olfactory 
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epithelium within some biopsies was verified with immunohistochemistry for neuron- 
specific enolase. RNA as extracted from the snap-frozen biopsies using standard 
protocols. For 5 ! RACE, reverse transcription and further cycles of amplification were 
carried out with the SMART kit (Clontech) using reverse nested primers, on RNA 

5 isolated from olfactory mucosa. 

Our results indicate, contrary to preliminary data (Dulac, C. & Axel, 
R. Cell 83, 195-206, 1995), that the human genome contains at least one VIR-like 
gene that is not a pseudogene; furthermore, that this gene is transcribed into a spliced 
mRNA within cells of the olfactory mucosa of adults; and finally, that the transcript 

10 has the potential to encode a seven-transmembrane protein homologous to putative 
pheromone receptors of rodents. 

********* 

The present invention is not to be limited in scope by the specific 
embodiments described herein. Indeed, various modifications of the invention in 
addition to those described herein will become apparent to those skilled in the art 
from the foregoing description and the accompanying figures. Such modifications are 
intended to fall within the scope of the appended claims. 

It is further to be understood that all values, are approximate, and are 
provided for description. 

All patents, patent applications, publications, and other materials cited 
herein are hereby incorporated herein reference in their entireties. 
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WE CLAIM : 

1 1 . An isolated human vomeronasal-like receptor. 

1 2. The receptor according to claim 1 which comprises an amino „ 

2 acid sequence having greater than about 90% amino acid sequence identity to the 

3 amino acid sequence depicted in SEQ ID NO:2 or SEQ ID NO:4. 

1 3. The receptor according to claim 2 comprising a modification of 

2 the amino acid sequence selected from the group consisting of Ser201 -> Phe, Ala229 

3 -+ Asp, and both. 

1 4. The receptor according to claim 1 which is encoded by a DNA 

2 molecule having a nucleotide sequence as depicted in SEQ ID NO:l or SEQ ID NO:3, 

3 or SEQ ID NO: 17. 

1 5. An antigenic fragment of the receptor of claim 1 . 

1 6. A chimeric polypeptide comprising an amino acid sequence of 

2 a human vomeronasal-like receptor fused to a heterologous amino acid sequence. 

1 7. The chimeric polypeptide of claim 6, wherein the amino acid 

2 sequence having functional activity is selected from a group consisting of a signal 

3 peptide, an antibody tag, an expression tag, a chromatographic tag, a cytoplasmic 

4 signal domain, and a G-protein binding domain. 

1 8. A nucleic acid encoding the polypeptide of claim 6. 



1 



9. 



A nucleic acid encoding the polypeptide of claim 7. 
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1 10. A nucleic acid encoding a human vomeronasal-like receptor. 

1 11. The nucleic acid of claim 10 which hybridizes under high 

2 stringency conditions to a nucleic acid having a sequence corresponding to or 

3 complementary to a nucleic acid sequence as depicted in SEQ ID NO: 1 or SEQ ID 

4 NO:3. 

1 12. The nucleic acid of claim 1 1 which encodes a protein 

2 comprising an amino acid sequence as depicted in SEQ ID NO:2 or SEQ ID NO:4. 

1 13. The nucleic acid of claim 1 0 which comprises a nucleotide 

2 sequence as depicted in SEQ ID NO: 1 or SEQ ID NO:3. 

1 14. An isolated nucleic acid comprising a nucleotide sequence 

2 corresponding to or complementary to at least a ten base length of a nucleotide 

3 sequence as depicted in SEQ ID NO: 1 or SEQ ID NO:3. 

1 15. The nucleic acid of claim 14 which is single stranded. 

1 16. The nucleic acid of claim 1 5 which is selected from the group 

2 consisting of SEQ ID NOS:5-14. 

1 17. The nucleic acid of claim 15 which is labeled. 

1 18. The nucleic acid of claim 1 5 which hybridizes under 

2 intracellular conditions to an mRNA encoding a human vomeronasal-like receptor. 

1 1 9. A vector comprising the nucleic acid encoding a human 

2 vomeronasal-like receptor of claim 1 0. 
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1 20. The vector of claim 19 wherein the nucleic acid encoding a 

2 human vomeronasal-iike receptor is operatively associated with an expression control 

3 sequence that permits expression of the receptor in a host cell. 

1 2 1 . A host cell comprising the vector of claim 20. 

1 22. A method for producing a human vomeronasal-like receptor, 

2 which method comprises culturing the host cell of claim 2 1 under conditions that 

3 permit expression of the receptor. 

1 23 . An isolated host cell that expresses a human vomeronasal-like 

2 receptor, with the proviso that the cell is not a human cell that endogenously express 

3 the receptor. 

1 24. A non-human animal that expresses a human vomeronasal-like 

2 receptor. 

1 25. An antibody that specifically binds to a human 

2 vomeronasal-like receptor. 

1 26. A method of identifying a compound that binds to the receptor 

2 of claim 1, which method comprises detecting association of a candidate compound 

3 with the receptor when the compound and the receptor, wherein detection of such 

4 association indicates that the compound binds to the receptor. 

1 27. The method according to claim 26. wherein the compound is 

2 labeled. 

1 28. The method according to claim 26, wherein the receptor is 

2 present in a membrane of a cell. 
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1 29. The method according to claim 26, wherein the receptor is 

2 present free of cellular components. 

1 30. The method according to claim 26, wherein the compound that 

2 binds to the receptor modulates receptor signaling. 

1 3 1 . A method for detecting a compound that agonizes the receptor 

2 of claim 1, which method comprises detecting G-protein activation in a cell that 

3 expresses the receptor when the cell is contacted with a compound that binds to the 

4 receptor. 

1 32. A method for detecting expression of a human 

2 vomeronasal-like receptor in a cell, which method comprises detecting the presence of 

3 mRNA encoding the receptor in the cell. 

1 33. A method for detecting expression of a human 

2 vomeronasal-like receptor in a cell, which method comprises detecting the presence of 

3 a human vomeronasal-like receptor in the cell. 

1 34. A method for identifying an allelic variant of a gene encoding a 

2 human vomeronasal-like receptor, which method comprises detecting a polymorphism 

3 in a gene encoding a human vomeronasal-like receptor when compared to a sequence 

4 of a gene encoding a human vomeronasal-like receptor as depicted in SEQ ID NO: 1 . 

1 35. The method according to claim 34, wherein the polymorphism 

2 is detected by sequencing. 



1 



36. 



An isolated primate vomeronasal-like receptor. 
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1 37. The receptor according to claim 36 which comprises an amino 

2 acid sequence having greater than about 90% amino acid sequence identity to the 

3 amino acid sequence depicted in SEQ ID NO:2 or SEQ ID NO:4. 

1 38. The receptor according to claim 37 comprising a modification 

2 of the amino acid sequence selected from the group consisting of Ser201 -* Phe, 

3 Ala229 -* Asp, and both. 

1 39, The receptor according to claim 36 which is encoded by a DNA 

2 molecule having a nucleotide sequence as depicted in SEQ ID NO:l or SEQ ID NO:3. 

1 40. A nucleic acid encoding a primate vomeronasal-like receptor. 

1 41. The nucleic acid of claim 40 which hybridizes under high 

2 stringency conditions to a nucleic acid having a sequence corresponding to or 

3 complementary to a nucleic acid sequence as depicted in SEQ ID NO: 1 or SEQ ID 

4 NO:3. 

1 42. The nucleic acid of claim 41 which encodes a protein 

2 comprising an amino acid sequence as depicted in SEQ ID NO:2 or SEQ ID NO:4. 

1 43 . The nucleic acid of claim 40 which comprises a nucleotide 

2 sequence as depicted in SEQ ID NO: 1 or SEQ ID NO:3. 
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FIG. I 

1/1 31/11 

ata act ttt gga aaa gta aaa tea ggg att age ttc etc att cag act gga gtt ggg ate 
M A FGKVKSGIS FLIQTGVGI 
61/21 91/31 

ctg gga aat tec ttt etc ctt tgt ttt tat aac tta att ttg ttc act gga cac aag ctg 
L G NSF LLCFYNLILFTGHKL 
121/41 151/51 

aqa ccc acg gac ttg att etc age caa ctg gee ttg get aac tec atg gtc ctt ttc ttt 
RPTDL! LSQLALANSMVLFF 
181/61 211/71. 

aaa ggg ata cct cag aca atg gca get ttt gga ttg aaa tat ttg ctg aat gac act gga 
KGIPQTMAAFGLKYLLNDTG 

241/81 271/91 

tgt aag ttt gtc ttt tat tat cac agg gtg ggc aca aga gtt tec etc age acc ate tgc 
CKFVFYYHR VGTRVSLST IC 

301/101 331/11 1 

ctt etc aat gga ttc caa gee att aag etc aac ccc agt ata tgc agg tgg atg gag ate 
LLNGFQAIKL N PSICRWMEI 
361/121 391/131 

aaq att aga tec cca agg ttt att gac ttc tgt tgt etc etc tgc tgg gec ccc cat gtc 
K I RSPRFIDFCCLLCWAPHV 

421/141 451/151 

ttq atg aat gca tct gtt ctt eta tta gtg aat ggc cca ctg aat age aaa aac agt agt 
LMNASVLLLVNGPLNSKNSS 

481/161 511/171 

gca aaa aac aat tat gga tac tgt tct tac aaa gca tea aag aga ttt age tea tta cat 
A KNNYG YCSYKA SKRFSS LH 
541/181 571/191 

gca gtc tta tat ttt tec cct gat ttt atg agt ttg ggc ttc atg gtc tgg gee agt ggc 
AVLYFSPDFMSLGFMVWASG 

601/201 631/211 

tec atq gtc ttc ttc etc tac aga cac aag cag caa gtc caa cac aat cac age aac aga 
SM VFFLYRHKQQVQHNHSNR 

661/221 691/231 

etc tec tgc aga cct tec cag gaa gee aga gec aca cac acc ate atg gtc ctg gtg age 
LSCRPSQEARATHT I MVLVS 
721/241 751/251 

tec ttt ttt gtt ttc tat tea gtc cat agt ttt ctg aca att tgg aca act gta gtt gca 
SFFVFYSVHSFLTI WTTVVA 
781/261 811/271 

aac cca ggc cag tgg ata gtg acc aac tct gtg ttg gtc gee tea tgt ttc cca gca cgc 
NPGQWIVTNSVLVASCFPAR 

841/281 " 871/291 

age cct ttt gtc etc ate atg agt gat act cat ate tct cag ttc tgt ttt gee tgc agg 

SPFVLI MSDTHISQFCFACR 

901/301 931/311 

aca agg aaa aca etc ttt cct aat ctg gtt gtc atg cca tga 

TRKTLFPNLVVMP* 

SUBSTITUTE SHEET (RULE 26) 
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FIG. 2a 



1/1 31/11 

atg gtt gga gac aca tta aaa ctt ctg tct cca ctg atg aca aga tac ttc ttt ctg ctt 
MVGDTLKLLS PLMTRYF FLL 
61/21 91/31 

ttt tat tct act gat tct tea gac etc aat gaa aat caa cat ccc eta gat ttt gat gaa 
FYSTDSSDLNENQHPLDFDE 
121/41 151/51 

atg get ttt gga aaa gta aaa tea ggg att age ttc etc att cag act gga gtt ggg ate 
MAFGKVKSGISFLIQTGVGI 
181/61 211/71 

ctg gga aat tec ttt etc ctt tgt ttt tat aac tta att ttg ttc act gga cac aag ctg 
LGNSFLLCFYNLILFTGHKL 
241/81 271/91 

aga ccc acg gac ttg att etc age caa ctg gee ttg get aac tec atg gtc ctt ttc ttt 
R PTDLILSQLALANSMVLFF 
301/101 331/111 

aaa ggg ata cct cag aca atg gca get ttt gga ttg aaa tat ttg ctg aat gac act gga 
KG 1 PQTMAAFG LKYL LN DTG 
361/121 391/131 

tgt aag ttt gtc ttt tat tat cac agg gtg ggc aca aga gtt tec etc age ace ate tgc 
CKFVF YYHR VGT RV SLST I C 
421/141 451/151 

ctt etc aat gga ttc caa gec att aag etc aac ccc agt ata tgc agg tgg atg gag ate 
L LNGFQAI KLNPSI CRWMEI 
481/161 511/171 

aag att aga tec cca agg ttt att gac ttc tgt tgt etc etc tgc tgg gee ccc cat gtc 
Kl RSPRFIDFCCLLCWAPHV 
541/181 571/191 

ttg atg aat gca tct gtt ctt eta tta gtg aat ggc cca ctg aat age aaa aac agt agt 
LMNASVLLLVNGPLNSKNS S 
601/201 631/211 

gca aaa aac aat tat gga tac tgt tct tac aaa gca tea aag aga ttt age tea tta cat 
AKNNYGYCSYKASKRFSS LH 
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FIG. 2b 



A A 

661/221 691/231 

gca gtc tta tat ttt tec cct gat ttt atg agt ttg ggc ttc atg gtc tgg gec agt ggc 
AVLYFSPDFMSLGFMVWASG 
721/241 751/251 

tec atg gtc ttc ttc etc tac aga cac aag cag caa gtc caa cac aat cac age aac aga 
SMVFFLYRHKQQVQHNHSNR 
781/261 811/271 

etc tec tgc aga cct tec cag gaa gee aga gee aca cac ace ate atg gtc ctg gtg age 
LSCRPSQEARATHTIMVLVS 
841/281 871/291 

tec ttt ttt gtt ttc tat tea gtc cat agt ttt ctg aca att tgg aca act gta gtt gca 
SFFVFYSVHSFLT I WTTVVA 
901/301 931/311 

aac cca ggc cag tgg ata gtg ace aac tct gtg ttg gtc gee tea tgt ttc cca gca cgc 
NP6QWIVTNSVLVASCFPAR 
961/321 991/331 

age cct ttt gtc etc ate atg agt gat act cat ate tct cag ttc tgt ttt gee tgc agg 

S PFVLI MSDTHI SQFCFACR 

1021/341 1051/351 

aca agg aaa aca etc ttt cct aat ctg gtt gtc atg cca tga 

TR KTLFPNLVVMP* 
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